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METHODS FOR SYNTHESIS OF ENCODED LIBRARIES 

Related Applications 

This application claims priority to U.S. Provisional Patent Application Serial 
No. 60/530854 , filed on December 17, 2003; U.S. Provisional Patent Application Serial 
No. 60/540681, filed on January 30, 2004; U.S. Provisional Patent Application Serial 
No. 60/553,715 filed March 15, 2004; and U.S. Provisional Patent Application Serial 
No. 60/588,672 filed July 16, 2004, the entire contents of each of which are incorporated 
herein by reference. 

Background of the invention 

The search for more efficient methods of identifying compounds having useful 
biological activities has led to the development of methods for screening vast numbers 
of distinct compounds, present in collections referred to as combinatorial libraries. Such 
libraries can include 10 5 or more distinct compounds. A variety of methods exist for 
producing combinatorial libraries, and combinatorial syntheses of peptides, 
peptidomimetics and small organic molecules have been reported. 

The two major challenges in the use of combinatorial approaches in drug 
discovery are the synthesis of libraries of sufficient complexity and the identification of 
molecules which are active in the screens used. It is generally acknowledged that 
greater the degree of complexity of a library, i.e., the number of distinct structures 
present in the library, the greater the probability that the library contains molecules with 
the activity of interest. Therefore, the chemistry employed in library synthesis must be 
capable of producing vast numbers of compounds within a reasonable time frame. 
However, for a given formal or overall concentration, increasing the number of distinct 
members within the library lowers the concentration of any particular library member. 
This complicates the identification of active molecules from high complexity libraries. 

One approach to overcoming these obstacles has been the development of 
encoded libraries, and particularly libraries in which each compound includes an 
amplifiable tag. Such libraries include DNA-encoded libraries, in which a DNA tag 
identifying a library member can be amplified using techniques of molecular biology, 
such as the polymerase chain reaction. However, the use of such methods for producing 
very large libraries is yet to be demonstrated, and it is clear that improved methods for 
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producing such libraries are required for the realization of the potential of this approach 
to drug discovery. 

Summary of the invention 

The present invention provides a method of synthesizing libraries of molecules 
which include an encoding oligonucleotide tag. The method utilizes a "split and pool" 
strategy in which a solution comprising an initiator, comprising a first building block 
linked to an encoding oligonucleotide, is divided ("split") into multiple fractions. In 
each fraction, the initiator is reacted with a second, unique, building block and a second, 
unique oligonucleotide which identifies the second building block. These reactions can 
be simultaneous or sequential and, if sequential, either reaction can precede the other. 
The dimeric molecules produced in each of the fractions are combined ("pooled") and 
then divided again into multiple fractions. Each of these fractions is then reacted with a 
third unique (fraction-specific) building block and a third unique oligonucleotide which 
encodes the building block. The number of unique molecules present in the product 
library is a function of (1) the number of different building blocks used at each step of 
the synthesis, and (2) the number of times the pooling and dividing process is repeated. 

In one embodiment, the invention provides a method of synthesizing a molecule 
comprising or consisting of a functional moiety which is operatively linked to an 
encoding oligonucleotide. The method includes the steps of: (1) providing an initiator 
compound consisting of a functional moiety comprising n building blocks, where n is an 
integer of 1 or greater, wherein the functional moiety comprises at least one reactive 
group and wherein the functional moiety is operatively linked to an initial 
oligonucleotide; (2) reacting the initiator compound with a building block comprising at 
least one complementary reactive group, wherein the at least one complementary 
reactive group is complementary to the reactive group of step (1), under suitable 
conditions for reaction of the reactive group and the complementary reactive group to 
form a covalent bond; (3) reacting the initial oligonucleotide with an incoming 
oligonucleotide which identifies the building block of step (b) in the presence of an 
enzyme which catalyzes ligation of the initial oligonucleotide and the incoming 
oligonucleotide, under conditions suitable for ligation of the incoming oligonucleotide 
and the initial oligonucleotide, thereby producing a molecule which comprises or 
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consists of a functional moiety comprising n+1 building blocks which is operatively 
linked to an encoding oligonucleotide. If the functional moiety of step (3) comprises a 
reactive group, steps 1 -3 can repeated one or more times, thereby forming cycles 1 to i, 
where i is an integer of 2 or greater, with the product of step (3) of a cycle s, where s is 
an integer of i-1 or less, becoming the initiator compound of cycle s + 1. 

In one embodiment, the invention provides a method of synthesizing a library of 
compounds, wherein the compounds comprise a functional moiety comprising two or 
more building blocks which is operatively linked to an oligonucleotide which identifies 
the structure of the functional moiety. The method comprises the steps of (1) providing 
a solution comprising m initiator compounds, wherein m is an integer of 1 or greater, 
where the initiator compounds consist of a functional moiety comprising n building 
blocks, where n is an integer of 1 or greater, which is operatively linked to an initial 
oligonucleotide which identifies the n building blocks; (2) dividing the solution of step 
(1) into r fractions, wherein r is an integer of 2 or greater; (3) reacting the initiator 
compounds in each fraction with one of r building blocks, thereby producing r fractions 
. comprising compounds consisting of a functional moiety comprising n+1 building . 
blocks operatively linked to the initial oligonucleotide; (4) reacting the initial 
oligonucleotide in each fraction with one of a set of r distinct incoming oligonucleotides 
in the presence of an enzyme which catalyzes the ligation of the incoming 
oligonucleotide and the initial oligonucleotide, under conditions suitable for enzymatic 
ligation of the incoming oligonucleotide and the initial oligonucleotide, thereby 
producing r aliquots comprising molecules consisting of a functional moiety comprising 
n+1 building blocks operatively linked to an elongated oligonucleotide which encodes 
the n+1 building blocks. Optionally, the method can further include the step of (5) 
recombining the r fractions produced in step (4), thereby producing a solution 
comprising compounds consisting of a functional moiety comprising n+1 building 
blocks, which is operatively linked to an elongated oligonucleotide. Steps (1) to (5) can 
be conducted one or more times to yield cycles 1 to i, where i is an integer of 2 or 
greater. In cycle s+1, where s is an integer of i-1 or less, the solution comprising m 
initiator compounds of step (1) is the solution of step (5) of cycle s. Likewise, the 
initiator compounds of step (1) of cycle s+1 are the compounds of step (5) of cycle s. 

In a preferred embodiment, the building blocks are coupled in each step using 
conventional chemical reactions. The building blocks can be coupled to produce linear 
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or branched polymers or oligomers, such as peptides, peptidomimetics, and peptoids, or 
non-oligomeric molecules, such as molecules comprising a scaffold structure to which is 
attached one or more additional chemical moieties.. For example, if the building blocks 
are amino acid residues, the building blocks can be coupled using standard peptide 
synthesis strategies, such as solution-phase or solid phase synthesis using suitable 
protection/deprotection strategies as are known in the field. Preferably, the building 
blocks are coupled using solution phase chemistry. The encoding oligonucleotides are 
single stranded or double stranded oligonucleotides, preferably double-stranded 
oligonucleotides. The encoding oligonucleotides are preferably oligonucleotides of 4 to 
12 bases or base pairs per building block; the encoding oligonucleotides can be coupled 
using standard solution phase or solid phase oligonucleotide synthetic methodology, but 
are preferably coupled using a solution phase enzymatic process. For example, the 
oligonucleotides can be coupled using a topoisomerase, a ligase, or a DNA polymerase, 
if the sequence of the encoding oligonucleotides includes an initiation sequence for 
ligation by one of these enzymes. Enzymatic coupling of the encoding oligonucleotides 
offers the advantages of (1) greater accuracy of addition compared to standard synthetic 
(non-enzymatic) coupling; and (2) the use of a simpler protection/deprotection strategy. 
In another aspect, the invention provides compounds of Formula I: 




where X is a functional moiety comprising one or more building blocks; Z is an 
oligonucleotide attached at its 3' terminus to B; Y is an oligonucleotide which is 
attached at its 5' terminus to C; A is a functional group that forms a covalent bond with 
X; B is a functional group that forms a bond with the 3 '-end of Z; C is a functional 
group that forms a bond with the 5 '-end of Y; D, F and E are each, independently, a 
bifunctional linking group; and S an atom or a molecular scaffold. Such compounds 
include those which are synthesized using the methods of the invention. 
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The invention further relates to a compound library comprising compounds 
comprising a functional moiety comprising two or more building blocks which is 
operatively linked to an oligonucleotide which encodes the structure of the functional 
moiety. Such libraries can comprise from about 10 2 to about 10 12 or more distinct 
members, for example, 10 2 , 10 3 , 10 4 , 10 5 , 10 6 , 10 7 , 10 8 , 10 9 , 10 10 , 10 11 , 10 12 or more 
distinct members, i.e., distinct molecular structures. 

In one embodiment, the compound library comprises compounds which are each 
independently of Formula I: 




where X is a functional moiety comprising one or more building blocks; Z is an 
oligonucleotide attached at its 3' terminus to B; Y is an oligonucleotide which is 
attached at its 5' terminus to C; A is a functional group that forms a covalent bond with 
X; B is a functional group that forms a bond with the 3 '-end of Z; C is a functional 
group that forms a bond with the 5 '-end of Y; D, F and E are each, independently, a 
bifiinctional linking group; and S an atom or a molecular scaffold. Such libraries 
include those which are synthesized using the methods of the invention. 

In another aspect, the invention provides a method for identifying a compound 
which binds to a biological target, said method comprising the steps of: (a) contacting the 
biological target with a compound library of the invention, where the compound library 
includes compounds which comprise a functional moiety comprising two or more 
building blocks which is operatively linked to an oligonucleotide which encodes the 
structure of the functional moiety. This step is conducted under conditions suitable for 
at least one member of the compound library to bind to the target; (2) removing library 
members that do not bind to the target; (3) amplifying the encoding oligonucleotides of 
the at least one member of the compound library which binds to the target; (4) 
sequencing the encoding oligonucleotides of step (3); and using the sequences 
determined in step (5) to determine the structure of the functional moieties of the 
members of the compound library which bind to the biological target. 
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The present invention provides several advantages in the identification of 
molecules having a desired property. For example, the methods of the invention allow 
the use of a range of chemical reactions for constructing the molecules in the presence of 
the oligonucleotide tag. The methods of the invention also provide a high-fidelity means 
of incorporating oligonucleotide tags into the chemical structures so produced. Further, 
they enable the synthesis of libraries having a large number of copies of each member, 
thereby allowing multiple rounds of selection against a biological target while leaving a 
sufficient number of molecules following the final round for amplification and sequence 
of the oligonucleotide tags. 

Brief description of the drawings 

Figure 1 is a schematic representation of ligation of double stranded 
oligonucleotides, in which the initial oligonucleotide has an overhang which is 
complementary to the overhang of the incoming oligonucleotide. The initial strand is 
represented as either free, conjugated to an aminohexyl linker or conjugated to a 
phenylalanine residue via an aminohexyl linker. 

Figure 2 is a schematic representation of oligonucleotide ligation using a splint 
strand. In this embodiment, the splint is a 12-mer oligonucleotide with sequences 
complementary to the single-stranded initial oligonucleotide and the single-stranded 
incoming oligonucleotide. 

Figure 3 is a schematic representation of ligation of an initial oligonucleotide and 
an incoming oligonucleotide, when the initial oligonucleotide is double-stranded with 
covalently linked strands, and the incoming oligonucleotide is double-stranded. 

Figure 4 is a schematic representation of oligonucleotide elongation using a 
polymerase. The initial strand is represented as either free, conjugated to an aminohexyl 
linker or conjugated to a phenylalanine residue via an aminohexyl linker. 

Figure 5 is a schematic representation of the synthesis cycle of one embodiment 
of the invention. 

Figure 6 is a schematic representation of a multiple round selection process using 
the libraries of the invention. 

Figure 7 is a gel resulting from electrophoresis of the products of each of cycles 
1 to 5 described in Example 1 and following ligation of the closing primer. Molecular 
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weight standards are shown in lane 1, and the indicated quantities of a hyperladder, for 
DNA quantitation, are shown in lanes 9 to 12. 

Figure 8 is a schematic depiction of the coupling of building blocks using azide- 
alkyne cycloaddition. 

Figures 9 and 10 illustrate the coupling of building blocks via nucleophilic 
aromatic substitution on a chlorinated triazine. 

Figure 1 1 shows representative chlorinated hetero aromatic structures suitable for 
use in the synthesis of functional moieties. 

Figure 12 illustrates the cyclization of a linear peptide using the azide/alkyne 
cycloaddition reaction. 

Figure 13a is a chromatogram of the library produced as described in Example 2 
follwing Cycle 4. 

Figure 13b is a mass spectrum of the library produced as described in Example 2 
following Cycle 4. 

Detailed description of the invention 

The present invention relates to methods of producing compounds and 
combinatorial compound libraries, the compounds and libraries produced via the 
methods of the invention, and methods of using the libraries to identify compounds 
having a desired property, such as a desired biological activity. The invention further 
relates to the compounds identified using these methods. 

A variety of approaches have been taken to produce and screen combinatorial 
chemical libraries. Examples include methods in which the individual members of the 
library are physically separated from each other, such as when a single compound is 
synthesized in each of a multitude of reaction vessels. However, these libraries are 
typically screened one compound at a time, or at most, several compounds at a time and 
do not, therefore, result in the most efficient screening process. In other methods, 
compounds are synthesized on solid supports. Such solid supports include chips in 
which specific compounds occupy specific regions of the chip or membrane ("position 
addressable"). In other methods, compounds are synthesized on beads, with each bead 
containing a different chemical structure. 
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Two difficulties that arise in screening large libraries are (1) the number of 
distinct compounds that can be screened; and (2) the identification of compounds which 
are active in the screen. In one method, the compounds which are active in the screen 
are identified by narrowing the original library into ever smaller fractions and 
subfractions, in each case selecting the fraction or subtraction which contains active 
compounds and further subdividing until attaining an active sub fraction which contains a 
set of compounds which is sufficiently small that all members of the subset can be 
individually synthesized and assessed for the desired activity. This is a tedious and time 
consuming activity. 

Another method of deconvoluting the results of a combinatorial library screen is 
to utilize libraries in which the library members are tagged with an identifying label, that 
is, each label present in the library is associated with a discreet compound structure 
present in the library, such that identification of the label tells the structure of the tagged 
molecule. One approach to tagged libraries utilizes oligonucleotide tags, as described, 
for example, in US Patent Nos. 5,573,905; 5,708,153; 5,723,598, 6,060,596 published 
PCT applications WO 93/06121; WO 93/20242; WO 94/13623; WO 00/23458; WO r ? 
02/074929 and WO 02/103008, and by Brenner and Lerner (Proc. Natl Acad. Set USA 
89, 5381-5383 (1992); Nielsen and Janda (Methods: A Companion to Methods in 
Enzymology 6, 361-371 (1994); and Nielsen, Brenner and Janda (J. Am. Chem. Soc. 115, 
9812-9813 (1993)), each of which is incorporated herein by reference in its entirety. 
Such tags can be amplified, using for example, polymerase chain reaction, to produce 
many copies of the tag and identify the tag by sequencing. The sequence of the tag then 
identifies the structure of the binding molecule, which can be synthesized in pure form 
and tested. To date, there has been no report of the use of the methodology disclosed 
by Lerner et al. to prepare large libraries. The present invention provides an 
improvement in methods to produce DNA-encoded libraries, as well as the first 
examples of large (10 5 members or greater) libraries of DNA-encoded molecules in 
which the functional moiety is synthesized using solution phase synthetic methods. 

The present invention provides methods which enable facile synthesis of 
oligonucleotide-encoded combinatorial libraries, and permit an efficient, high-fidelity 
means of adding such an oligonucleotide tag to each member of a vast collection of 
molecules. 
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The methods of the invention include methods for synthesizing biftinctional 
molecules which comprise a first moiety ("functional moiety") which is made up of 
building blocks, and a second moiety operatively linked to the first moiety, comprising 
an oligonucleotide tag which identifies the structure of the first moiety, i.e., the 
oligonucleotide tag indicates which building blocks were used in the construction of the 
first moiety, as well as the order in which the building blocks were linked. Generally, 
the information provided by the oligonucleotide tag is sufficient to determine the 
building blocks used to construct the active moiety. In certain embodiments, the 
sequence of the oligonucleotide tag is sufficient to determine the arrangement of the 
building blocks in the functional moiety, for example, for peptidic moieties, the amino 
acid sequence. 

The term "functional moiety" as used herein, refers to a chemical moiety 
comprising one or more building blocks. Preferably, the building blocks in the 
functional moiety are not nucleic acids. The functional moiety can be a linear or 
branched or cyclic polymer or oligomer or a small organic molecule. 

The term "building block" as used herein, is a chemical structural unit which is 
linked to other chemical structural units or can be linked to other such units. When the 
functional moiety is polymeric or oligomeric, the building blocks are the monomeric 
units of the polymer or oligomer. Building blocks can also include a scaffold structure 
("scaffold building block") to which is, or can be, attached one or more additional 
structures ("peripheral building blocks"). 

It is to be understood that the term "building block" is used herein to refer to a 
chemical structural unit as it exists in a functional moiety and also in the reactive form 
used for the synthesis of the functional moiety. Within the functional moiety, a building 
block will exist without any portion of the building block which is lost as a consequence 
of incorporating the building block into the functional moiety. For example, in cases in 
which the bond-forming reaction releases a small molecule (see below), the building 
block as it exists in the functional moiety is a "building block residue", that is, the 
remainder of the building block used in the synthesis following loss of the atoms that it 
contributes to the released molecule. 

The building blocks can be any chemical compounds which are complementary, 
that is the building blocks must be able to react together to form a structure comprising 
two or more building blocks. Typically, all of the building blocks used will have at least 
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two reactive groups, although it is possible that some of the building blocks (for example 
the last building block in an oligomeric functional moiety) used will have only one 
reactive group each. Reactive groups on two different building blocks should be 
complementary, i.e., capable of reacting together to form a covalent bond, optionally 
with the concomitant loss of a small molecule, such as water, HC1, HF, and so forth. 

For the present purposes, two reactive groups are complementary if they are 
capable of reacting together to form a covalent bond. In a preferred embodiment, the 
bond forming reactions occur rapidly under ambient conditions without substantial 
formation of side products. Preferably, a given reactive group will react with a given 
complementary reactive group exactly once. In one embodiment, complementary 
reactive groups of two building blocks react, for example, via nucleophilic substitution, 
to form a covalent bond. In one embodiment, one member of a pair of complementary 
reactive groups is an electrophilic group and the other member of the pair is a 
nucleophilic group. 

Complementary electrophilic and nucleophilic groups include any two groups 
which react via nucleophilic substitution under suitable conditions to form a covalent 
bond. A variety of suitable bond-forming reactions are known in the art. See, for 
example, March, Advanced Organic Chemistry, fourth edition, New York: John Wiley 
and Sons (1992), Chapters 10 to 16; Carey and Sundberg, Advanced Organic Chemistry, 
Part B, Plenum (1990), Chapters 1-11; and Collman et aL, Principles and Applications of 
Organotransition Metal Chemistry, University Science Books, Mill Valley, Calif. 
(1987), Chapters 13 to 20; each of which is incorporated herein by reference in its 
entirety. Examples of suitable electrophilic groups include reactive carbonyl groups, 
such as acyl chloride groups, ester groups, including carbonyl pentafluorophenyl esters 
and succinimide esters, ketone groups and aldehyde groups; reactive sulfonyl groups, 
such as sulfonyl chloride groups, and reactive phosphonyl groups. Other electrophilic 
groups include terminal epoxide groups, isocyanate groups and alkyl halide groups. 
Suitable nucleophilic groups include primary and secondary amino groups and hydroxyl 
groups and carboxyl groups. 

Suitable complementary reactive groups are set forth below. One of skill in the 
art can readily determine other reactive group pairs that can be used in the present 
method, and the examples provided herein are not intended to be limiting. 
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In a first embodiment, the complementary reactive groups include activated 
carboxyl groups, reactive sulfonyl groups or reactive phosphonyl groups, or a 
combination thereof, and primary or secondary amino groups. In this embodiment, the 
complementary reactive groups react under suitable conditions to form an amide, 
sulfonamide or phosphonamidate bond. 

In a second embodiment, the complementary reactive groups include epoxide 
groups and primary or secondary amino groups. An epoxide-containing building block 
reacts with an amine-containing building block under suitable conditions to form a 
carbon-nitrogen bond, resulting in a 6-amino alcohol. 

In another embodiment, the complementary reactive groups include aziridine 
groups and primary or secondary amino groups. Under suitable conditions, an aziridine- 
containing building block reacts with an amine-containing building block to form a 
carbon-nitrogen bond, resulting in a 1,2-diamine. In a third embodiment, the 
complementary reactive groups include isocyanate groups and primary or secondary 
amino groups. An isocyanate-containing building block will react with an amino- 
containing building block under suitable conditions to form a carbon-nitrogen bond, 
resulting in a urea group. 

In a fourth embodiment, the complementary reactive groups include isocyanate 
groups and hydroxyl groups. An isocyanate-containing building block will react with an 
hydroxyl-containing building block under suitable conditions to form a carbon-oxygen 
bond, resulting in a carbamate group. 

In a fifth embodiment, the complementary reactive groups include amino groups 
and carbonyl-containing groups, such as aldehyde or ketone groups. Amines react with 
such groups via reductive amination to form a new carbon-nitrogen bond.. 

In a sixth embodiment, the complementary reactive groups include phosphorous 
ylide groups and aldehyde or ketone groups. A phosphorus-ylide-containing building 
block will react with an aldehyde or ketone-containing building block under suitable 
conditions to form a carbon-carbon double bond, resulting in an alkene. 

In a seventh embodiment, the complementary reactive groups react via 
cycloaddition to form a cyclic structure. One example of such complementary reactive 
groups are alkynes and organic azides, which react under suitable conditions to form a 
triazole ring structure. An example of the use of this reaction to link two building blocks 
is illustrated in Figure 8. Suitable conditions for such reactions are known in the art and 
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include those disclosed in WO 03/101972, the entire contents of which are incorporated 
by reference herein. 

In an eighth embodiment, the complementary reactive groups are an alkyl halide 
and a nucleophile, such as an amino group, a hydroxyl group or a carboxyl group. Such 
groups react under suitable conditions to form a carbon-nitrogen (alkyl halide plus 
amine) or carbon oxygen (alkyl halide plus hydroxyl or carboxyl group). 

In a ninth embodiment, the complementary functional groups are a halogenated 
heteroaromatic group and a nucleophile, and the building blocks are linked under 
suitable conditions via aromatic nucleophilic substitution. Suitable halogenated 
heteroaromatic groups include chlorinated pyrimidines, triazines and purines, which 
react with nucleophiles, such as amines, under mild conditions in aqueous solution. 
Representative examples of the reaction of an oligonucleotide-tagged trichlorotriazine 
with amines are shown in Figures 9 and 10. Examples of suitable chlorinated 
heteroaromatic groups are shown in Figure 1 1 . 

It is to be understood that the synthesis of a functional moiety can proceed via 
one particular type of coupling reaction, such? as, but not limited to, one of the reactions 
discussed above, or via a combination of two or more coupling reactions, such as two or 
more of the coupling reactions discussed above. For example, in one embodiment, the 
building blocks are joined by a combination of amide bond formation (amino and 
carboxylic acid complementary groups) and reductive amination (amino and aldehyde or 
ketone complementary groups). Any coupling chemistry can be used, provided that it is 
compatible with the presence of an oligonucleotide. Double stranded (duplex) 
oligonucleotide tags, as used in certain embodiments of the present invention, are 
chemically more robust than single stranded tags, and, therefore, tolerate a broader range 
of reaction conditions and enable the use of bond- forming reactions that would not be 
possible with single-stranded tags. 

A building block can include one or more functional groups in addition to the 
reactive group or groups employed to form the functional moiety. One or more of these 
additional functional groups can be protected to prevent undesired reactions of these 
functional groups. Suitable protecting groups are known in the art for a variety of 
functional groups (Greene and Wuts, Protective Groups in Organic Synthesis , second 
edition, New York: John Wiley and Sons (1991), incorporated herein by reference). 
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Particularly useful protecting groups include t-butyl esters and ethers, acetals, trityl 
ethers and amines, acetyl esters, trimethylsilyl ethers,trichloroethyl ethers and esters and 
carbamates. 

In one embodiment, each building block comprises two reactive groups, which 
can be the same or different. For example, each building block added in cycle s can 
comprise two reactive groups which are the same, but which are both complementary to 
the reactive groups of the building blocks added at steps s-1 and s + 1. In another 
embodiment, each building block comprises two reactive groups which are themselves 
complementary. For example, a library comprising polyamide molecules can be 
produced via reactions between building blocks comprising two primary amino groups 
and building blocks comprising two activated carboxyl groups. In the resulting 
compounds there is no N- or C-terminus, as alternate amide groups have opposite 
directionality. Alternatively, a polyamide library can be produced using building blocks 
that each comprise an amino group and an activated carboxyl group. In this 
embodiment, the building blocks added in step n of the cycle will have a free reactive 
group which is complementary to the available reactive group -on the n-1 building block, 
while, preferably, the other reactive group on the nth building block is protected. For 
example, if the members of the library are synthesized from the C to N direction, the 
building blocks added will comprise an activated carboxyl group and a protected amino 
group. 

The functional moieties can be polymeric or oligomeric moieties, such as 
peptides, peptidomimetics, peptide nucleic acids or peptoids, or they can be small non- 
polymeric molecules, for example, molecules having a structure comprising a central 
scaffold and structures arranged about the periphery of the scaffold. Linear polymeric or 
oligomeric libraries will result from the use of building blocks having two reactive 
groups, while branched polymeric or oligomeric libraries will result from the use of 
building blocks having three or more reactive groups, optionally in combination with 
building blocks having only two reactive groups. Such molecules can be represented by 
the general formula X1X2. - .X n , where each X is a monomelic unit of a polymer 
comprising n monomelic units, where n is an integer greater than 1 In the case of 
oligomeric or polymeric compounds, the terminal building blocks need not comprise 
two functional groups. For example, in the case of a polyamide library, the C-terminal 
building block can comprise an amino group, but the presence of a carboxyl group is 
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optional. Similarly, the building block at the N-terminus can comprise a carboxyl group, 
but need not contain an amino group. 

Branched oligomeric or polymeric compounds can also be synthesized provided 
that at least one building block comprises three functional groups which are reactive 
with other building blocks. A library of the invention can comprise linear molecules, 
branched molecules or a combination thereof. 

Libraries can also be constructed using, for example, a scaffold building block 
having two or more reactive groups, in combination with other building blocks having 
only one available reactive group, for example, where any additional reactive groups are 
either protected or not reactive with the other reactive groups present in the scaffold 
building block. In one embodiment, for example, the molecules synthesized can be 
represented by the general formula X(Y) n , where X is a scaffold building block; each Y 
is a building block linked to X and n is an integer of at least two, and preferably an 
integer from 2 to about 6. In one preferred embodiment, the initial building block of 
cycle 1 is a scaffold building block. In molecules of the formula X(Y) n , each Y can be 
< the same or different, but in most members of a typical library, each Y will be different. 

In one embodiment, the libraries of the invention comprise polyamide 
compounds. The polyamide compounds can be composed of building blocks derived 
from any amino acids, including the twenty naturally occurring ot-amino acids, such as 
alanine (Ala; A), glycine (Gly; G), asparagine (Asn; N), aspartic acid (Asp; D), glutamic 
acid (Glu; E), histidine (His; H), leucine (Leu; L), lysine (Lys; K), phenylalanine (Phe; 
F), tyrosine (Tyr; Y), threonine (Thr; T), serine (Ser; S), arginine (Arg; R), valine (Val; 
V), glutamine (Gin; Q), isoleucine (He; I), cysteine (Cys; C), methionine (Met; M), 
proline (Pro; P) and tryptophan (Trp; W), where the three-letter and one-letter codes for 
each amino acid are given. In their naturally occurring form, each of the foregoing 
amino acids exists in the L-configuration, which is to be assumed herein unless 
otherwise noted. In the present method, however, the D-configuration forms of these 
amino acids can also be used. These D-amino acids are indicated herein by lower case 
three- or one-letter code, i.e., ala (a), gly (g), leu (1), gin (q), thr (t), ser (s), and so forth. 
The building blocks can also be derived from other oc-amino acids, including, but not 
limited to, 3-arylalanines, such as naphthylalanine, phenyl-substituted phenylalanines, 
including 4-fluoro-, 4-chloro, 4-bromo and 4-methylphenylalanine; 3-heteroarylalanines, 
such as 3-pyridylalanine, 3-thienylalanine, 3-quinolylalanine, and 3-imidazolylalanine; 
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ornithine; citrulline; homocitrulline; sarcosine; homoproline; homocysteine; substituted 
proline, such as hydroxyproline and fluoroproline; dehydroproline; norleucine; O- 
methyltyrosine; O-methylserine; O-methylthreonine and 3-cyclohexylalanine. Each of 
the preceding amino acids can be utilized in either the D- or L-configuration. 

The building blocks can also be amino acids which are not <x-amino acids, such 
as ot-azaamino acids; p, y, 5, e,-amino acids, and N-substituted amino acids, such as N- 
substituted glycine, where the N-substituent can be, for example, a substituted or 
unsubstituted alkyl, aryl, heteroaryl, arylalkyl or heteroarylalkyl group. In one 
embodiment, the N-substituent is a side chain from a naturally-occurring or non- 
naturally occurring a-amino acid. 

The building block can also be a peptidomimetic structure, such as a dipeptide, 
tripeptide, tetrapeptide or pentapeptide mimetic. Such peptidomimetic building blocks 
are preferably derived from amino acyl compounds, such that the chemistry of addition 
of these building blocks to the growing poly(aminoacyl) group is the same as, or similar 
to, the chemistry used for the other building blocks. The building blocks can also be 
molecules which are capable of forming bonds which are isosteric with a peptide bond, 
to form peptidomimetic functional moieties comprising a peptide backbone 
modification, such as [CH 2 S], \p [CH 2 NH], ^[CSNH 2 ], ^[NHCO], tA[COCH 2 ], and 
\[/[(E) or (Z) CH=CH]. In the nomenclature used above, \j/ indicates the absence of an 
amide bond. The structure that replaces the amide group is specified within the brackets. 

In one embodiment, the invention provides a method of synthesizing a 
compound comprising or consisting of a functional moiety which is operatively linked to 
an encoding oligonucleotide. The method includes the steps of: (1) providing an 
initiator compound consisting of an initial functional moiety comprising n building 
blocks, where n is an integer of 1 or greater, wherein the initial functional moiety 
comprises at least one reactive group, and wherein the initial functional moiety is 
operatively linked to an initial oligonucleotide which encodes the n building blocks; (2) 
reacting the initiator compound with a building block comprising at least one 
complementary reactive group, wherein the at least one complementary reactive group is 
complementary to the reactive group of step (1), under suitable conditions for reaction of 
the reactive group and the complementary reactive group to form a covalent bond; (3) 
reacting the initial oligonucleotide with an incoming oligonucleotide in the presence of 
an enzyme which catalyzes ligation of the initial oligonucleotide and the incoming 
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oligonucleotide, under conditions suitable for ligation of the incoming oligonucleotide 
and the initial oligonucleotide, thereby producing a molecule which comprises or 
consists of a functional moiety comprising n+1 building blocks which is operatively 
linked to an encoding oligonucleotide. If the functional moiety of step (3) comprises a 
reactive group, steps 1-3 can be repeated one or more times, thereby forming cycles 1 to 
i, where i is an integer of 2 or greater, with the product of step (3) of a cycle s-1, where s 
is an integer of i or less, becoming the initiator compound of step (1) of cycle s. In each 
cycle, one building block is added to the growing functional moiety and one 
oligonucleotide sequence, which encodes the new building block, is added to the 
growing encoding oligonucleotide. 

In a preferred embodiment, each individual building block is associated with a 
distinct oligonucleotide, such that the sequence of nucleotides in the oligonucleotide 
added in a given cycle identifies the building block added in the same cycle. 

The coupling of building blocks and ligation of oligonucleotides will generally 
occur at similar concentrations of starting materials and reagents. For example, 
concentrations of reactants on the order of micromolar to millimolar, for example from- 
about 10 jjM to about 10 mM, are preferred in order to have efficient coupling of 
building blocks. 

In certain embodiments, the method further comprises, following step (2), the 
step of scavenging any unreacted initial functional moiety. Scavenging any unreacted 
initial functional moiety in a particular cycle prevents the initial functional moiety of the 
cycle from reacting with a building block added in a later cycle. Such reactions could 
lead to the generation of functional moieties missing one or more building blocks, 
potentially leading to a range of functional moiety structures which correspond to a 
particular oligonucleotide sequence. Such scavenging can be accomplished by reacting 
any remaining initial functional moiety with a compound which reacts with the reactive 
group of step (2). Preferably, the scavenger compound reacts rapidly with the reactive 
group of step (2) and includes no additional reactive groups that can react with building 
blocks added in later cycles. For example, in the synthesis of a compound where the 
reactive group of step (2) is an amino group, a suitable scavenger compound is an N- 
hydroxysuccinimide ester, such as acetic acid N-hydroxysuccinimide ester. 

In another embodiment, the invention provides a method of producing a library 
of compounds, wherein each compound comprises a functional moiety comprising two 
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or more building block residues which is operatively linked to an oligonucleotide. In a 
preferred embodiment, the oligonucleotide present in each molecule provides sufficient 
information to identify the building blocks within the molecule and, optionally, the order 
of addition of the building blocks. In this embodiment, the method of the invention 
comprises a method of synthesizing a library of compounds, wherein the compounds 
comprise a functional moiety comprising two or more building blocks which is 
operatively linked to an oligonucleotide which identifies the structure of the functional 
moiety. The method comprises the steps of (1) providing a solution comprising m 
initiator compounds, wherein m is an integer of 1 or greater, where the initiator 
compounds consist of a functional moiety comprising n building blocks, where n is an 
integer of 1 or greater, which is operatively linked to an initial oligonucleotide which 
identifies the n building blocks; (2) dividing the solution of step (1) into at least r 
fractions, wherein r is an integer of 2 or greater; (3) reacting each fraction with one of r 
building blocks, thereby producing r fractions comprising compounds consisting of a 
functional moiety comprising n+1 building blocks operatively linked to the initial 
oligonucleotide; (4) reacting each of the r fractions of step (3) with one of a set of r 
distinct incoming oligonucleotides under conditions suitable for enzymatic ligation of 
the incoming oligonucleotide to the initial oligonucleotide, thereby producing r fractions 
comprising molecules consisting of a functional moiety comprising n+1 building blocks 
operatively linked to an elongated oligonucleotide which encodes the n+1 building 
blocks. Optionally, the method can further include the step of (5) recombining the r 
fractions, produced in step (4), thereby producing a solution comprising molecules 
consisting of a functional moiety comprising n+1 building blocks, which is operatively 
linked to an elongated oligonucleotide which encodes the n+1 building blocks. Steps 
(1) to (5) can be conducted one or more times to yield cycles 1 to i, where i is an integer 
of 2 or greater. In cycle s+1, where s is an integer of i-1 or less, the solution comprising 
m initiator compounds of step (1) is the solution of step (5) of cycle s. Likewise, the 
initiator compounds of step (1) of cycle s+1 are the products of step (4) in cycle s. 

Preferably the solution of step (2) is divided into r fractions in each cycle of the 
library synthesis. In this embodiment, each fract is reated with a unique building block. 

In the methods of the invention, the order of addition of the building block and 
the incoming oligonucleotide is not critical, and steps (2) and (3) of the synthesis of a 
molecule, and steps (3) and (4) in the library synthesis can be reversed, i.e., the 
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incoming oligonucleotide can be ligated to the initial oligonucleotide before the new 
building block is added. In certain embodiments, it may be possible to conduct these 
two steps simultaneously. 

In certain embodiments, the method further comprises, following step (2), the 
step of scavenging any unreacted initial functional moiety. Scavenging any unreacted 
initial functional moiety in a particular cycle prevents the initial functional moiety of a 
the cycle from reacting with a building block added in a later cycle. Such reactions 
could lead to the generation of functional moieties missing one or more building blocks, 
potentially leading to a range of functional moiety structures which correspond to a 
particular oligonucleotide sequence. Such scavenging can be accomplished by reacting 
any remaining initial functional moiety with a compound which reacts with the reactive 
group of step (2). Preferably, the scavenger compound reacts rapidly with the reactive 
group of step (2) and includes no additional reactive groups that can react with building 
blocks added in later cycles. For example, in the synthesis of a compound where the 
reactive group of step (2) is an amino group, a suitable scavenger compound is an N- 
hydroxysuccinimide ester, such as acetic acid N-hydroxysuccinimide ester. 

In one embodiment, the building blocks used in the library synthesis are selected 
from a set of candidate building blocks by evaluating the ability of the candidate 
building blocks to react with appropriate complementary functional groups under the 
conditions used for synthesis of the library. Building blocks which are shown to be 
suitably reactive under such conditions can then be selected for incorporation into the 
library. The products of a given cycle can, optionally, be purified. When the cycle is an 
intermediate cycle, i.e., any cycle prior to the final cycle, these products are 
intermediates and can be purified prior to initiation of the next cycle. If the cycle is the 
final cycle, the products of the cycle are the final products, and can be purified prior to 
any use of the compounds. This purification step can, for example, remove unreacted or 
excess reactants and the enzyme employed for oligonucleotide ligation. Any methods 
which are suitable for separating the products from other species present in solution can 
be used, including liquid chromatography, such as high performance liquid 
chromatography (HPLC) and precipitation with a suitable co-solvent, such as ethanol. 
Suitable methods for purification will depend upon the nature of the products and the 
solvent system used for synthesis. 
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The reactions are, preferably, conducted in aqueous solution, such as a buffered 
aqueous solution, but can also be conducted in mixed aqueous/organic media consistent 
with the solubility properties of the building blocks, the oligonucleotides, the 
intermediates and final products and the enzyme used to catalyze the oligonucleotide 
ligation. 

It is to be understood that the theoretical number of compounds produced by a 
given cycle in the method described above is the product of the number of different 
initiator compounds, m, used in the cycle and the number of distinct building blocks 
added in the cycle, r. The actual number of distinct compounds produced in the cycle 
can be as high as the product of r and m (r x m), but could be lower, given differences in 
reactivity of certain building blocks with certain other building blocks. For example, the 
kinetics of addition of a particular building block to a particular initiator compound may 
be such that on the time scale of the synthetic cycle, little to none of the product of that 
reaction may be produced. 

In certain embodiments, a common building block is added prior to cycle 1, 
following the last cycle or in between any two cycles./ For example, when the functional . 
moiety is a polyamide, a common N- terminal capping building block can be added after 
the final cycle. A common building block can also be introduced between any two 
cycles, for example, to add a functional group, such as an alkyne or azide group, which 
can be utilized to modify the functional moieties, for example by cyclization, following 
library synthesis. 

The term "operatively linked", as used herein, means that two chemical 
structures are linked together in such a way as to remain linked through the various 
manipulations they are expected to undergo. Typically the functional moiety and the 
encoding oligonucleotide are linked covalently via an appropriate linking group. The 
linking group is a bivalent moiety with a site of attachment for the oligonucleotide and a 
site of attachment for the functional moiety. For example, when the functional moiety is 
a polyamide compound, the polyamide compound can be attached to the linking group at 
its N-terminus, its C-terminus or via a functional group on one of the side chains. The 
linking group is sufficient to separate the polyamide compound and the oligonucleotide 
by at least one atom, and preferably, by more than one atom, such as at least two, at least 
three, at least four, at least five or at least six atoms. Preferably, the linking group is 
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sufficiently flexible to allow the polyamide compound to bind target molecules in a 
manner which is independent of the oligonucleotide. 

In one embodiment, the linking group is attached to the N-terminus of the 
polyamide compound and the 5 '-phosphate group of the oligonucleotide. For example, 
the linking group can be derived from a linking group precursor comprising an activated 
carboxyl group on one end and an activated ester on the other end. Reaction of the 
linking group precursor with the N-terminal nitrogen atom will form an amide bond 
connecting the linking group to the polyamide compound or N-terminal building block, 
while reaction of the linking group precursor with the 5 '-hydroxy group of the 
oligonucleotide will result in attachment of the oligonucleotide to the linking group via 
an ester linkage. The linking group can comprise, for example, a polymethylene chain, 
such as a -(CH 2 ) n - chain or a poly(ethylene glycol) chain, such as a -(CH 2 CH 2 0) n chain, 
where in both cases n is an integer from 1 to about 20. Preferably, n is from 2 to about 
12, more preferably from about 4 to about 10. In one embodiment, the linking group 
comprises a hexamethylene (-(CH2)6-) group. 

When the building blocks are amino acid residues,' the resulting functional 
moiety is a polyamide. The amino acids can be coupled using any suitable chemistry for 
the formation of amide bonds. Preferably, the coupling of the amino acid building 
blocks is conducted under conditions which are compatible with enzymatic ligation of 
oligonucleotides, for example, at neutral or near-neutral pH and in aqueous solution. In 
one embodiment, the polyamide compound is synthesized from the C-terminal to N- 
terminal direction. In this embodiment, the first, or C-terminal, building block is 
coupled at its carboxyl group to an oligonucleotide via a suitable linking group. The 
first building block is reacted with the second building block, which preferably has an 
activated carboxyl group and a protected amino group. Any activating/protecting group 
strategy which is suitable for solution phase amide bond formation can be used. For 
example, suitable activated carboxyl species include acyl fluorides (U.S. Patent No. 
5,360,928, incorporated herein by reference in its entirety), symmetrical anhydrides and 
N-hydroxysuccinimide esters. The acyl groups can also be activated in situ, as is known 
in the art, by reaction with a suitable activating compound. Suitable activating 
compounds include dicyclohexylcarbodiimide (DCC), diisopropylcarbodiimide (DIC), 
l-ethoxycarbonyl-2-ethoxy-l,2-dihydroquinoline (EEDQ), l-ethyl-3-(3- 
dimethylaminopropyl)carbodiimide hydrochloride (EDC), n-propane-phosphonic 
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anhydride (PPA), N,N-bis (2-oxo-3-oxazolidinyl)imido-phosphoryl chloride (BOP-C1), 
bromo-tris-pyrrolidinophosphonium hexafluorophosphate (PyBrop), diphenylphosphoryl 
azide (DPP A), Castro's reagent (BOP, PyBop), 0-benzotriazolyl-N,N,N\ N'- 
tetramethyluronium salts (HBTU), diethylphosphoryl cyanide (DEPCN), 2,5-diphenyl- 
2,3-dihydro-3-oxo-4-hydroxy-thiophene dioxide (Steglich's reagent; HOTDO), 1,1- 
carbonyl-diimidazole (CDI), and 4-(4,6-dimethoxy-l,3,5-triazin-2-yl)-4- 
methylmorpholinium chloride (DMT-MM). The coupling reagents can be employed 
alone or in combination with additives such as N. N-dimethyl-4-aminopyridine 
(DMAP), N-hydroxy-benzotriazole (HOBt), N-hydroxybenzotriazine (HOOBt), N- 
hydroxysuccinimide (HOSu) N-hydroxyazabenzotriazole (HO At), azabenzotriazolyl- 
tetramethyluronium salts (HATU, HAPyU) or 2-hydroxypyridine. In certain 
embodiments, synthesis of a library requires the use of two or more activation strategies, 
to enable the use of a structurally diverse set of building blocks. For each building block, 
one skilled in the art can determine the appropriate activation strategy. 

The N-terminal protecting group can be any protecting group which is 
i compatible with the conditions of the process, for example; protecting groups which are : = 
suitable for solution phase synthesis conditions. A preferred protecting group is the 
fluorenylmethoxycarbonyl ("Fmoc") group. Any potentially reactive functional groups 
on the side chain of the aminoacyl building block may also need to be suitably protected. 
Preferably the side chain protecting group is orthogonal to the N-terminal protecting 
group, that is, the side chain protecting group is removed under conditions which are 
different than those required for removal of the N-terminal protecting group. Suitable 
side chain protecting groups include the nitroveratryl group, which can be used to 
protect both side chain carboxyl groups and side chain amino groups. Another suitable 
side chain amine protecting group is the N-pent-4-enoyl group. 

The building blocks can be modified following incorporation into the functional 
moiety, for example, by a suitable reaction involving a functional group on one or more 
of the building blocks. Building block modification can take place following addition of 
the final building block or at any intermediate point in the synthesis of the functional 
moiety, for example, after any cycle of the synthetic process. When a library of 
bifunctional molecules of the invention is synthesized, building block modification can 
be carried out on the entire library or on a portion of the library, thereby increasing the 
degree of complexity of the library. Suitable building block modifying reactions include 
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those reactions that can be performed under conditions compatible with the functional 
moiety and the encoding oligonucleotide. Examples of such reactions include acylation 
and sulfonation of amino groups or hydroxyl groups, alkylation of amino groups, 
esterification or thioesterification of carboxyl groups, amidation of carboxyl groups, 
epoxidation of alkenes, and other reactions as are known the art. When the functional 
moiety includes a building block having an alkyne or an azide functional group, the 
azide/alkyne cycloaddition reaction can be used to derivatize the building block. For 
example, a building block including an alkyne can be reacted with an organic azide, or a 
building block including an azide can be reacted with an alkyne, in either case forming a 
triazole. Building block modification reactions can take place after addition of the final 
building block or at an intermediate point in the synthetic process, and can be used to 
append a variety of chemical structures to the functional moiety, including 
carbohydrates, metal binding moieties and structures for targeting certain biomolecules 
or tissue types. 

In another embodiment, the functional moiety comprises a linear series of 
c. building blocks and this linear series is cyclized using' a suitable reaction. :For example* : 
if at least two building blocks in the linear array include sulfhydryl groups, the 
sulfhydryl groups can be oxidized to form a disulfide linkage, thereby cyclizing the 
linear array. For example, the functional moieties can be oligopeptides which include 
two or more L or D-cysteine and/or L or D-homocysteine moieties. The building blocks 
can also include other functional groups capable of reacting together to cyclize the linear 
array, such as carboxyl groups and amino or hydroxyl groups. 

In a preferred embodiment, one of the building blocks in the linear array 
comprises an alkyne group and another building block in the linear array comprises an 
azide group. The azide and alkyne groups can be induced to react via cycloaddition, 
resulting in the formation of a macrocyclic structure. In the example illustrated in 
Figure 9, the functional moiety is a polypeptide comprising a propargylglycine building 
block at its C-terminus and an azidoacetyl group at its N-terminus. Reaction of the 
alkyne and the azide group under suitable conditions results in formation of a cyclic 
compound, which includes a triazole structure within the macrocycle. In the case of a 
library, in one embodiment, each member of the library comprises alkyne- and azide- 
containing building blocks and can be cyclized in this way. In a second embodiment, all 
members of the library comprises alkyne- and azide-containing building blocks, but only 
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a portion of the library is cyclized. In a third embodiment, only certain functional 
moieties include alkyne- and azide-containing building blocks, and only these molecules 
are cyclized. In the forgoing second and third embodiments, the library, following the 
cycloaddition reaction, will include both cyclic and linear functional moieties. 

The oligonucleotides are ligated using enzymatic methods. In one embodiment, 
the initial building block is operatively linked to an initial oligonucleotide. Prior to or 
following coupling of a second building block to the initial building block, a second 
oligonucleotide sequence which identifies the second building block is ligated to the 
initial oligonucleotide. Methods for ligating the initial oligonucleotide sequence and the 
incoming oligonucleotide sequence are set forth in Figures 1 and 2. In Figure 1, the 
initial oligonucleotide is double-stranded, and one strand includes an overhang sequence 
which is complementary to one end of the second oligonucleotide and brings the second 
oligonucleotide into contact with the initial oligonucleotide. Preferably the overhanging 
sequence of the initial oligonucleotide and the complementary sequence of the second 
oligonucleotide are both at least about 4 bases; more preferably both sequences are both 
the same length. . The initial oligonucleotide and the second oligonucleotide can be 
ligated using a suitable enzyme. If the initial oligonucleotide is linked to the first 
building block at the 5' end of one of the strands (the "top strand"), then the strand 
which is complementary to the top strand (the "bottom strand") will include the 
overhang sequence at its 5' end, and the second oligonucleotide will include a 
complementary sequence at its 5'end. Following ligation of the second oligonucleotide, 
a strand can be added which is complementary to the sequence of the second 
oligonucleotide which is 3' to the overhang complementary sequence, and which 
includes additional overhang sequence. 

In one embodiment, the oligonucleotide is elongated as set forth in Figure 2. The 
oligonucleotide bound to the growing functional moiety and the incoming 
oligonucleotide are positioned for ligation by the use of a "splint" sequence, which 
includes a region which is complementary to the 3' end of the initial oligonucleotide and 
a region which is complementary to the 5' end of the incoming oligonucleotide. The 
splint brings the 5' end of the oligonucleotide into proximity with the 3' end of the 
incoming oligo and ligation is accomplished using enzymatic ligation. In the example 
illustrated in Figure 2, the initial oligonucleotide consists of 16 nucleobases and the 
splint is complementary to the 6 bases at the 3' end. The incoming oligonucleotide 
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consists of 12 nucleobases, and the splint is complementary to the 6 bases at the 5' 
terminus. The length of the splint and the lengths of the complementary regions are not 
critical. However, the complementary regions should be sufficiently long to enable 
stable dimer formation under the conditions of the ligation, but not so long as to yield an 
excessively large encoding nucleotide in the final molecules. It is preferred that the 
complementary regions are from about 4 bases to about 12 bases, more preferably from 
about 5 bases to about 10 bases, and most preferably from about 5 bases to about 8 bases 
in length. 

In one embodiment, the initial oligonucleotide is double-stranded and the two 
strands are covalently joined. One means of covalently joining the two strands is shown 
in Figure 3, in which a linking moiety is used to link the two strands and the functional 
moiety. The linking moiety can be any chemical structure which comprises a first 
functional group which is adapted to react with a building block, a second functional 
group which is adapted to react with the 3' -end of an oligonucleotide, and a third 
functional group which is adapted to react with the 5 '-end of an oligonucleotide. 
Preferably, the second and third functional groups are oriented so as to position the two 
oligonucleotide strands in a relative orientation that permits hybridization of the two 
strands. For example, the linking moiety can have the general structure (I): 




E 



where A, is a functional group that can form a covalent bond with a building block, B is 
a functional group that can form a bond with the 5 '-end of an oligonucleotide, and C is a 
functional group that can form a bond with the 3 '-end of an oligonucleotide. D, F and E 
are chemical groups that link functional groups A, C and B toS, which is a core atom or 
scaffold. Preferably, D, E and F are each independently a chain of atoms, such as an 
alkylene chain or an oligo(ethylene glycol) chain, and D, E and F can be the same or 
different, and are preferably effective to allow hybridization of the two oligonucleotides 
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and synthesis of the functional moiety. In one embodiment, the trivalent linker has the 
structure 




In this embodiment, the NH group is available for attachment to a building block, while 
the terminal phosphate groups are available for attachment to an oligonucleotide. 

In embodiments in which the initial oligonucleotide is double-stranded, the 
incoming oligonucleotides are also double-stranded. As shown in Figure 3 ? the initial 
'oligonucleotide can 'have one strand which is longer than the other, providing an 
overhang sequence. In this embodiment, the incoming oligonucleotide includes an 
overhang sequence which is complementary to the overhang sequence of the initial 
oligonucleotide. Hybridization of the two complementary overhang sequences brings 
the incoming oligonucleotide into position for ligation to the initial oligonucleotide. 
This ligation can be performed enzymatically using a DNA or RNA ligase. The 
overhang sequences of the incoming oligonucleotide and the initial oligonucleotide are 
preferably the same length and consist of two or more nucleotides, preferably from 2 to 
about 10 nucleotides, more preferably from 2 to about 6 nucleotides. In one preferred 
embodiment, the incoming oligonucleotide is a double-stranded oligonucleotide having 
an overhang sequence at each end. The overhang sequence at one end is complementary 
to the overhang sequence of the initial oligonucleotide, while, after ligation of the 
incoming oligonucleotide and the initial oligonucleotide, the overhang sequence at the 
other end becomes the overhang sequence of initial oligonucleotide of the next cycle. In 
one embodiment, the three overhang sequences are all 2 to 6 nucleotides in length, and 
the encoding sequence of the incoming oligonucleotide is from 3 to 10 nucleotides in 
length, preferably 3 to 6 nucleotides in length. In a particular embodiment, the overhang 
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sequences are all 2 nucleotides in length and the encoding sequence is 5 nucleotides in 
length. 

In the embodiment illustrated in Figure 4, the incoming strand has a region at its 
3' end which is complementary to the 3' end of the initial oligonucleotide, leaving 
overhangs at the 5' ends of both strands. The 5' ends can be filled in using, for example, 
a DNA polymerase, such as vent polymerase, resulting in a double-stranded elongated 
oligonucleotide. The bottom strand of this oligonucleotide can be removed, and 
additional sequence added to the 3' end of the top strand using the same method. 

The encoding oligonucleotide tag is formed as the result of the successive 
addition of oligonucleotides that identify each successive building block. In one 
embodiment of the methods of the invention, the successive oligonucleotide tags may be 
coupled by enzymatic ligation to produce an encoding oligonucleotide. 

Enzyme-catalyzed ligation of oligonucleotides can be performed using any 
enzyme that has the ability to ligate nucleic acid fragments. Exemplary enzymes include 
ligases, polymerases, and topqispmerases.. In specific embodiments of the invention, 
PNA ligase (EG 6.5 .1,1), DNA;pDlymerase ^G-2 5 7 ? 7 : 7),JINA polymerase (EC 2.7.7.6) 
or topoisomerase (EC 5.99.1.2) are used to ligate the .oligonucleotides. Enzymes 
contained in each EC class can be found, for example, as described in Bairoch (2000) 
Nucleic Acids Research 28:304-5. 

In a preferred embodiment, the oligonucleotides used in the methods of the 
invention are oligodeoxynucleotides and the enzyme used to catalyze the 
oligonucleotide ligation is DNA ligase. In order for ligation to occur in the presence of 
the ligase, i.e., for a phosphodiester bond to be formed between two oligonucleotides, 
one oligonucleotide must have a free 5' phosphate group and the other oligonucleotide 
must have a free 3' hydroxyl group. Exemplary DNA ligases that may be used in the 
methods of the invention include T4 DNA ligase, Taq DNA ligase, T 4 RNA ligase, DNA 
ligase (E. coli) (all available from, for example, New England Biolabs, MA). 

One of skill in the art will understand that each enzyme used for ligation has 
optimal activity under specific conditions, e.g., temperature, buffer concentration, pH 
and time. Each of these conditions can be adjusted, for example, according to the 
manufacturer's instructions, to obtain optimal ligation of the oligonucleotide tags. 

The incoming oligonucleotide can be of any desirable length, but is preferably at 
least three nucleobases in length. More preferably, the incoming oligonucleotide is 4 or 
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more nucleobases in length. In one embodiment, the incoming oligonucleotide is from 3 
to about 12 nucleobases in length. It is preferred that the oligonucleotides of the 
molecules in the libraries of the invention have a common terminal sequence which can 
serve as a primer for PCR, as is known in the art. Such a common terminal sequence 
can be incorporated as the terminal end of the incoming oligonucleotide added in the 
final cycle of the library synthesis, or it can be added following library synthesis, for 
example, using the enzymatic ligation methods disclosed herein. 

A preferred embodiment of the method of the invention is set forth in Figure 5. 
The process begins with a synthesized DNA sequence which is attached at its 5' end to a 
linker which terminates in an amino group. In step 1, this starting DNA sequence is 
ligated to an incoming DNA sequence in the presence of a splint DNA strand, DNA 
ligase and dithiothreitol in Tris buffer. This yields a tagged DNA sequence which can 
then be used directly in the next step or purified, for example, using HPLC or ethanol 
precipitation, before proceeding to the next step. In step 2 the tagged DNA is reacted 
with a protected activated amino acid, in this example, an Fmoc-protected amino acid 
fluoride, yielding a protected amino acid-DNA conjugate. In step 3, the protected amino 
acid-DNA conjugate is deprotected, for example, in the presence of piperidine, and the 
resulting deprotected conjugate is, optionally, purified, for example, by HPLC or ethanol 
precipitation. The deprotected conjugate is the product of the first synthesis cycle, and 
becomes the starting material for the second cycle, which adds a second amino acid 
residue to the free amino group of the deprotected conjugate. 

In embodiments in which PCR is to be used to amplify the encoding 
oligonucleotides of selected molecules, the encoding oligonucleotides preferably include 
PCR primer sequences. For example, a PCR primer sequence can be included in the 
initial oligonucleotide prior to the first cycle of synthesis, or it can be included with the 
first incoming oligonucleotide. The encoding oligonucleotide can also include a capping 
PCR primer sequence that follows the encoding sequences. The capping sequence can 
be ligated to the encoding oligonucleotide following the final cycle of library synthesis 
or it can be included in the incoming oligonucleotide of the final cycle. In cases in 
which the PCR primer sequences are included in an incoming oligonucleotide, these 
incoming oligonucleotides will preferably be significantly longer than the incoming 
oligonucleotides added in the other cycles, because they will include both an encoding 
sequence and a PCR primer sequence. 
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In cases in which the capping sequence is added after the addition of the final 
building block and final incoming oligonucleotide, the synthesis of a library as set forth 
herein will include the step of ligating the capping sequence to the encoding 
oligonucleotide, such that the oligonucleotide portion of substantially all of the library 
members terminates in a sequence that includes a PCR primer sequence. PCR primer 
sequences suitable for use in the libraries of the invention are known in the art; suitable 
primers and methods are set forth, for example, in Innis et al., eds., PCR Protocols: A 
Guide to Methods and Applications, San Diego: Academic Press (1990), the contents of 
which are incorporated herein by reference in their entirety. Preferably, the capping 
sequence is added by ligation to the pooled fractions which are products of the final 
synthetic cycle. The capping sequence can be added using the enzymatic process used 
in the construction of the library. 

As indicated above, the nucleotide sequence of the oligonucleotide tag as part of 
the methods of this invention, may be determined by the use of the polymerase chain 
reaction (PCR). 

The oligonucleotide tagas comprised of ^polynucleotides that identify the building 
blocks that make up the functional moiety as described herein. The nucleic acid 
sequence of the oligonucleotide tag is determined by subjecting the oligonucleotide tag 
to a PCR reaction as follows. The appropriate sample is contacted with a PCR primer 
pair, each member of the pair having a preselected nucleotide sequence. The PCR primer 
pair is capable of initiating primer extension reactions by hybridizing to a PCR primer 
binding site on the encoding oligonucleotide tag. The PCR primer binding site is 
preferably designed into the encoding oligonucleotide tag. For example, a PCR primer 
binding site may be incorporated into the initial oligonucleotide tag and the second PCR 
primer binding site may be in the final oligonucleotide tag. Alternatively, the second 
PCR primer binding site may be incorporated into the capping sequence as described 
herein. In preferred embodiments, the PCR primer binding site is at least about 5, 7, 10, 
13, 15, 17, 20, 22, or 25 nucleotides in length. 

The PCR reaction is performed by mixing the PCR primer pair, preferably a 
predetermined amount thereof, with the nucleic acids of the encoding oligonucleotide 
tag, preferably a predetermined amount thereof, in a PCR buffer to form a PCR reaction 
admixture. The admixture is thermocycled for a number of cycles, which is typically 
predetermined, sufficient for the formation of a PCR reaction product. A sufficient 
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amount of product is one that can be isolated in a sufficient amount to allow for DNA 
sequence determination. 

PCR is typically carried out by thermocycling i.e., repeatedly increasing and 
decreasing the temperature of a PCR reaction admixture within a temperature range 
whose lower limit is about 30 °C to about 55 °C and whose upper limit is about 90 °C to 
about 100 °C. The increasing and decreasing can be continuous, but is preferably phasic 
with time periods of relative temperature stability at each of temperatures favoring 
polynucleotide synthesis, denaturation and hybridization. 

The PCR reaction is performed using any suitable method. Generally it occurs in 
a buffered aqueous solution, i.e., a PCR buffer, preferably at a pH of 7-9. Preferably, a 
molar excess of the primer is present. A large molar excess is preferred to improve the 
efficiency of the process. 

The PCR buffer also contains the deoxyribonucleotide triphosphates 
(polynucleotide synthesis substrates) dATP, dCTP, dGTP, and dTTP and a polymerase, 
typically thermostable, all in adequate amounts for primer extension (polynucleotide 
synthesis) reaction. The resulting soluiion XPCR admixture) is heated to about 90° C- 
100° C for about 1 to 10 minutes, preferably from 1 to 4 minutes. After this heating 
period the solution is allowed to cool to 54° C, which is preferable for primer 
hybridization. The synthesis reaction may occur at a temperature ranging from room 
temperature up to a temperature above which the polymerase (inducing agent) no longer 
functions efficiently. Thus, for example, if DNA polymerase is used, the temperature is 
generally no greater than about 40° C. The thermocycling is repeated until the desired 
amount of PCR product is produced. An exemplary PCR buffer comprises the following 
reagents: 50 mM KC1; 10 mM Tris-HCl at pH 8.3; 1.5 mM MgCl.sub.2 ; 0.001% 
(wt/vol) gelatin, 200 |xM dATP; 200 |aM dTTP; 200 jliM dCTP; 200 ^iM dGTP; and 2.5 
units Thermus aquaticus (Taq) DNA polymerase I per 100 microliters of buffer. 

Suitable enzymes for elongating the primer sequences include, for example, E. 
coli DNA polymerase I, Taq DNA polymerase, Klenow fragment of E. coli DNA 
polymerase I, T4 DNA polymerase, other available DNA polymerases, reverse 
transcriptase, and other enzymes, including heat-stable enzymes, which will facilitate 
combination of the nucleotides in the proper manner to form the primer extension 
products which are complementary to each nucleic acid strand. Generally, the synthesis 
will be initiated at the 3 f end of each primer and proceed in the 5' direction along the 
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template strand, until synthesis terminates, producing molecules of different lengths. 

The newly synthesized DNA strand and its complementary strand form a double- 
stranded molecule which can be used in the succeeding steps of the analysis process. 

PCR amplification methods are described in detail in U.S. Patent Nos. 4,683,192, 
4,683,202, 4,800,159, and 4,965,188, and at least in PCR Technology: Principles and 
Applications for DNA Amplification, H. Erlich, ed., Stockton Press, New York (1989); 
and PCR Protocols: A Guide to Methods and Applications, Innis et aL, eds., Academic 
Press, San Diego, Calif. (1990). The contents of all the foregoing documents are 
incorporated herein by reference. 

The term "polynucleotide" as used herein in reference to primers, probes and 
nucleic acid fragments or segments to be synthesized by primer extension is defined as a 
molecule comprised of two or more deoxyribonucleotides, preferably more than three. 

The term "primer" as used herein refers to a polynucleotide whether purified 
from a nucleic acid restriction digest or produced synthetically, which is capable of 
acting as a point of initiation of nucleic acid synthesis when placed under conditions in 
- iwhich synthesis of a. primer ex tension? product which is complementary to a nucleic acid 
strand is induced, i.e., in the presence of nucleotides and an agent for polymerization 
such as DNA polymerase, reverse transcriptase and the like, and at a suitable 
temperature and pH. The primer is preferably single stranded for maximum efficiency, 
but may alternatively be in double stranded form. If double stranded, the primer is first 
treated to separate it from its complementary strand before being used to prepare 
extension products. Preferably, the primer is a polydeoxyribonucleotide. The primer 
must be sufficiently long to prime the synthesis of extension products in the presence of 
the agents for polymerization. The exact lengths of the primers will depend on many 
factors, including temperature and the source of primer. 

The primers used herein are selected to be "substantially" complementary to the 
different strands of each specific sequence to be amplified. This means that the primer 
must be sufficiently complementary so as to non-randomly hybridize with its respective 
template strand. Therefore, the primer sequence may or may not reflect the exact 
sequence of the template. 

The polynucleotide primers can be prepared using any suitable method, such as, 
for example, the phosphotriester or phosphodiester methods described in Narang et aL, 
(1979) Meth. EnzymoL, 68:90; U.S. Pat. No. 4,356,270, U.S. Pat. No. 4,458,066, U.S. 
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Pat. No. 4,416,988, U.S. Pat. No. 4,293,652; and Brown et al 9 (1979) Meth. EnzymoL, 
68:109. The contents of all the foregoing documents are incorporated herein by 
reference. 

Once the encoding oligonucleotide tag has been amplified, the sequence of the 
tag, and ultimately the composition of the selected molecule, can be determined using 
nucleic acid sequence analysis, a well known procedure for determining the sequence of 
nucleotide sequences. Nucleic acid sequence analysis is approached by a combination 
of (a) physiochemical techniques, based on the hybridization or denaturation of a probe 
strand plus its complementary target, and (b) enzymatic reactions with polymerases. 

The invention further relates to the compounds which can be produced using the 
methods of the invention, and collections of such compounds, either as isolated species 
or pooled to form a library of chemical structures. Compounds of the invention include 
compounds of the formula 

Vs. 



where X is a functional moiety comprising one or more building blocks, Z is an 
oligonucleotide attached at its 3' terminus to B and Y is an oligonucleotide which is 
attached to C at its 5' terminus. A is a functional group that forms a covalent bond with 
X, B is a functional group that forms a bond with the 3 '-end of Z and C is a functional 
group that forms a bond with the 5 '-end of Y. D, F and E are chemical groups that link 
functional groups A, C and B to S, which is a core atom or scaffold. Preferably, D, E 
and F are each independently a chain of atoms, such as an alkylene chain or an 
oligo(ethylene glycol) chain, and D, E and F can be the same or different, and are 
preferably effective to allow hybridization of the two oligonucleotides and synthesis of 
the functional moiety. 

Preferably, Y and Z are substantially complementary and are oriented in the 
compound so as to enable Watson-Crick base pairing and duplex formation under 
suitable conditions. Y and Z are the same length or different lengths. Preferably, Y and 
Z are the same length, or one of Y and Z is from 1 to 10 bases longer than the other. In a 
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preferred embodiment, Y and Z are each 10 or more bases in length and have 
complementary regions of ten or more base pairs. More preferably, Y and Z are 
substantially complementary throughout their length, i.e., they have no more than one 
mismatch per every ten base pairs. Most preferably, Y and Z are complementary 
throughout their length, i.e., except for any overhang region on Y or Z, the strands 
hybridize via Watson-Crick base pairing with no mismatches throughout their entire 
length. 

S can be a single atom or a molecular scaffold. For example, S can be a carbon 
atom, a boron atom, a nitrogen atom or a phosphorus atom, or a polyatomic scaffold, 
such as a phosphate group or a cyclic group, such as a cycloalkyl, cycloalkenyl, 
heterocycloalkyl, heterocycloalkenyl, aryl or heteroaryl group. In one embodiment, the 
linker is a group of the structure 



N (CH 2 )n 




OP(0) 2 0- (CH 2 CH 2 0) m OFO 3 - 

OP(0) 2 0- (CH 2 CH 2 0)p — -OPO 3 



where each of n, m and p is, independently, an integer from 1 to about 20, preferably 
from 2 to eight, and more preferably from 3 to 6. In one particular embodiment, the 
linker has the structure shown below. 



HN 




In one embodiment, the libraries of the invention include molecules consisting of 
a functional moiety composed of building blocks, where each functional moiety is 
operatively linked to an encoding oligonucleotide. The nucleotide sequence of the 
encoding oligonucleotide is indicative of the building blocks present in the functional 
moiety, and in some embodiments, the connectivity or arrangement of the building 
blocks. The invention provides the advantage that the methodology used to construct 
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the functional moiety and that used to construct the oligonucleotide tag can be 
performed in the same reaction medium, preferably an aqueous medium, thus 
simplifying the method of preparing the library compared to methods in the prior art. In 
certain embodiments in which the oligonucleotide ligation steps and the building block 
addition steps can both be conducted in aqueous media, each reaction will have a 
different pH optimum. In these embodiments, the building block addition reaction can 
be conducted at a suitable pH and temperature in a suitable aqueous buffer. The buffer 
can then be exchanged for an aqueous buffer which provides a suitable pH for 
oligonucleotide ligation. 

One advantage of the methods of the invention is that they can be used to prepare 
libraries comprising vast numbers of compounds. The ability to amplify encoding 
oligonucleotide sequences using known methods such as polymerase chain reaction 
("PCR") means that selected molecules can be identified even if relatively few copies 
are recovered. This allows the practical use of very large libraries, which, as a 
consequence of their high degree of complexity, either comprise relatively few copies of 
any given library member, or require the use of very large volumes. For example, a 
library consisting of 10 8 unique structures in which each structure has 1 x 10 12 copies 
(about 1 picomole), requires about 100 L of solution at 1 |uM effective concentration. 
For the same library, if each member is represented by 1,000,000 copies, the volume 
required is 100 |aL at 1 |iM effective concentration. 

In a preferred embodiment, the library comprises from about 10 3 to about 10 15 
copies of each library member. Given differences in efficiency of synthesis among the 
library members, it is possible that different library members will have different 
numbers of copies in any given library. Therefore, although the number of copies of 
each member theoretically present in the library may be the same, the actual number of 
copies of any given library member is independent of the number of copies of any other 
member. More preferably, the compound libraries of the invention include at least about 
10 5 , 10 6 or 10 7 copies of each library member, or of substantially all library members. 
By "substantially all" library members is meant at least about 85% of the members of 
the library, preferably at least about 90%, and more preferably at least about 95% of the 
members of the library. 

Preferably, the library includes a sufficient number of copies of each member 
that multiple rounds (i.e., two or more) of selection against a biological target can be 
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performed, with sufficient quantities of binding molecules remaining following the final 
round of selection to enable amplification of the oligonucleotide tags of the remaining 
molecules and, therefore, identification of the functional moieties of the binding 
molecules. A schematic representation of such a selection process is illustrated in 
Figure 6, in which 1 and 2 represent library members, B is a target molecule and X is a 
moiety operatively linked to B that enables the removal of B from the selection medium. 
In this example, compound 1 binds to B, while compound 2 does not bind to B. The 
selection process, as depicted in Round 1, comprises (I) contacting a library comprising 
compounds 1 and 2 with B-X under conditions suitable for binding of compound 1 to B; 
(II) removing unbound compound 2, (III) dissociating compound 1 from B and 
removing BX from the reaction medium. The result of Round 1 is a collection of 
molecules that is enriched in compound 1 relative to compound 2. Subsequent rounds 
employing steps I-III result in further enrichment of compound 1 relative to compound 
2. Although three rounds of selection are shown in Figure 6, in practice any number of 
rounds may be employed, for example from one round to ten rounds, to achieve the 
desired enrichment of binding molecules relative to nonrbinding molecules . 

In the embodiment shown in Figure 6, there is no amplification (synthesis of 
more copies) of the compounds remaining after any of the rounds of selection. Such 
amplification can lead to a mixture of compounds which is not consistent with the 
relative amounts of the compounds remaining after the selection. This inconsistency is 
due to the fact that certain compounds may be more readily synthesized that other 
compounds, and thus may be amplified in a manner which is not proportional to their 
presence following selection. For example, if compound 2 is more readily synthesized 
than compound 1 , the amplification of the molecules remaining after Round 2 would 
result in a disproportionate amplification of compound 2 relative to compound 1, and a 
resulting mixture of compounds with a much lower (if any) enrichment of compound 1 
relative to compound 2. 

In one embodiment, the target is immobilized on a solid support by any known 
immobilization technique. The solid support can be, for example, a water-insoluble 
matrix contained within a chromatography column or a membrane. The encoded library 
can be applied to a water-insoluble matrix contained within a chromatography column. 
The column is then washed to remove non-specific binders. Target-bound compounds 
can then be dissociated by changing the pH, salt concentration, organic solvent 
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concentration, or other methods, such as competition with a known ligand to the target. 

In another embodiment, the target is free in solution and is incubated with the 
encoded library. Compounds which bind to the target (also referred to herein as 
"ligands") are selectively isolated by a size separation step such as gel filtration or 
ultrafiltration. In one embodiment, the mixture of encoded compounds and the target 
biomolecule are passed through a size exclusion chromatography column (gel filtration), 
which separates any ligand-target complexes from the unbound compounds. The ligand- 
target complexes are transferred to a reverse-phase chromatography column, which 
dissociates the ligands from the target. The dissociated ligands are then analyzed by 
PCR amplification and sequence analysis of the encoding oligonucleotides. This 
approach is particularly advantageous in situations where immobilization of the target 
may result in a loss of activity. 

Once single ligands are identified by the above-described process, various levels 
of analysis can be applied to yield structure-activity relationship information and to 
guide further optimization of the affinity, specificity and bioactivity of the ligand. For 
•ligands derived from the same scaffold, three-dimensional molecular modeling can be 
employed to identify significant structural features common to the ligands, thereby 
generating families of small-molecule ligands that presumably bind at a common site on 
the target biomolecule. 

A variety of screening approaches can be used to obtain ligands that possess 
high affinity for one target but significantly weaker affinity for another closely related 
target. One screening strategy is to identify ligands for both biomolecules in parallel 
experiments and to subsequently eliminate common ligands by a cross-referencing 
comparison. In this method, ligands for each biomolecule can be separately identified as 
disclosed above. This method is compatible with both immobilized target biomolecules 
and target biomolecules free in solution. 

For immobilized target biomolecules, another strategy is to add a preselection 
step that eliminates all ligands that bind to the non-target biomolecule from the library. 
For example, a first biomolecule can be contacted with an encoded library as described 
above. Compounds which do not bind to the first biomolecule are then separated from 
any first biomolecule-ligand complexes which form. The second biomolecule is then 
contacted with the compounds which did not bind to the first biomolecule. Compounds 
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which bind to the second biomolecule can be identified as described above and have 
significantly greater affinity for the second biomolecule than to the first biomolecule. 

A ligand for a biomolecule of unknown function which is identified by the 
method disclosed above can also be used to determine the biological function of the 
biomolecule. This is advantageous because although new gene sequences continue to be 
identified, the functions of the proteins encoded by these sequences and the validity of 
these proteins as targets for new drug discovery and development are difficult to 
determine and represent perhaps the most significant obstacle to applying genomic 
information to the treatment of disease. Target-specific ligands obtained through the 
process described in this invention can be effectively employed in whole cell biological 
assays or in appropriate animal models to understand both the function of the target 
protein and the validity of the target protein for therapeutic intervention. This approach 
can also confirm that the target is specifically amenable to small molecule drug 
discovery. 

In one embodiment, one or more compounds within a library of the invention 
are identified as ligands for a particular biomolecule . These compounds can then be: ; ; 
assessed in an in vitro assay for the ability to bind to the biomolecule. Preferably, the 
functional moieties of the binding compounds are synthesized without the 
oligonucleotide tag or linker moiety, and these functional moieties are assessed for the 
ability to bind to the biomolecule. 

The effect of the binding of the functional moieties to the biomolecule on the 
function of the biomolecule can also be assessed using in vitro cell-free or cell-based 
assays. For a biomolecule having a known function, the assay can include a comparison 
of the activity of the biomolecule in the presence and absence of the ligand, for example, 
by direct measurement of the activity, such as enzymatic activity, or by an indirect 
measure, such as a cellular function that is influenced by the biomolecule. If the 
biomolecule is of unknown function, a cell which expresses the biomolecule can be 
contacted with the ligand and the effect of the ligand on the viability, function, 
phenotype, and/or gene expressionof the cell is assessed. The in vitro assay can be, for 
example, a cell death assay, a cell proliferation assay or a viral replication assay. For 
example, if the biomolecule is a protein expressed by a virus, a cell infected with the 
virus can be contacted with a ligand for the protein. The affect of the binding of the 
ligand to the protein on viral viability can then be assessed. 
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A ligand identified by the method of the invention can also be assessed in an in 
vivo model or in a human. For example, the ligand can be evaluated in an animal or 
organism which produces the biomolecule. Any resulting change in the health status 
(e.g., disease progression) of the animal or organism can be determined. 

For a biomolecule, such as a protein or a nucleic acid molecule, of unknown 
function, the effect of a ligand which binds to the biomolecule on a cell or organism 
which produces the biomolecule can provide information regarding the biological 
function of the biomolecule. For example, the observation that a particular cellular 
process is inhibited in the presence of the ligand indicates that the process depends, at 
least in part, on the function of the biomolecule. 

Ligands identified using the methods of the invention can also be used as affinity 
reagents for the biomolecule to which they bind. In one embodiment, such ligands are 
used to effect affinity purification of the biomolecule, for example, via chromatography 
of a solution comprising the biomolecule using a solid phase to which one or more such 
ligands are attached. 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application, as well as the Figures and the Sequence 
Listing, are hereby incorporated in reference. 

Examples 



Example 1 : Synthesis and Characterization of a library on the order of 10 5 members 

The synthesis of a library comprising on the order of 10 5 distinct members was 
accomplished using the following reagents: 

Compound 1 : 



I ^O-TGACTCCCAAATCAATGTG-3' 



HoN 




Q / v "0-ACTGAGGGTTTAGTTAC-P0 4 -5' 



-37- 



WO 2005/058479 



PCT/US2004/042964 



Single letter codes for deoxyribonucleotides: 
A = adenosine 
C = cytidine 
G = guanosine 
T = thymidine 



Building block precursors: 
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Fmoc 




OH 



Fmoc O 



Fmoc 




OH 



BB10 



BB11 



BB12 



Oligonucleotide tags: 

Sequence Tag number 

5 ' - P0 4 - GCAACGAAG (SEQ ID NO:l) 1.1 
ACCGTTGCT-PO3-5' (SEQ ID NO: 2) 

5 ' - P0 3 - GCGTACAAG (SEQ ID NO : 3 ) 1.2 

ACCGCATGT-PO3-5' (SEQ ID NO: 4) 

5' -PO3-GCTCTGTAG (SEQ ID NO : 5) 1.3 

ACCGAGACA- PO3 - 5 ' (SEQ ID NO : 6.) 

5 ' - P0 3 - GTGCCATAG (SEQ ID NO:7) 1.4 

ACCACGGTA- PO3 - 5 ' (SEQ ID NO : 8 ) 

5 ' - P0 3 - GTTGACCAG (SEQ ID NO:9) 1.5 

ACCAACTGG- PO3 - 5 ' (SEQ ID NO: 10) 

5' -PO3-CGACTTGAC (SEQ ID NO: 11) 1.6 

CAAGTCGCA-P0 3 -5' (SEQ ID NO: 12) 

5 ' - PO3 - CGTAGTCAG (SEQ ID NO: 13) 1.7 

ACGCATCAG-P03-5 , (SEQ ID NO: 14) 

5 7 - PO3 - CCAGCATAG (SEQ ID NO:15) 1.8 

ACGGTCGTA- P0 3 - 5 ' (SEQ ID NO: 16) 

5 ' - PO3 - CCTACAGAG (SEQ ID NO: 17) 1.9 

ACGGATGTC - PO3 - 5 ' (SEQ ID NO: 18) 

5 ' - PO3 - CTGAACGAG (SEQ ID NO: 19) 1.10 

CGTTCAGCA-P0 3 -5' (SEQ ID NO: 20) 

5 ' -PO3-CTCCAGTAG (SEQ ID NO: 21) 1.11 

ACGAGGTCA- PO3 - 5 ' (SEQ ID NO: 22) 
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5 ' - PO3 -TAGGTCCAG (SEQ ID NO: 23) 1.12 

ACATCCAGG-P0 3 -5' (SEQ ID NO: 24) 

5' -PO3-GCGTGTTGT (SEQ ID NO: 25) 2.1 

TCCGCACAA- PO3 - 5 ' (SEQ ID NO: 26) 

5 ' - P0 3 - GCTTGGAGT (SEQ ID NO: 27) 2.2 

TCCGAACCT-P0 3 -5' (SEQ ID NO: 28) 

5' -PO3-GTCAAGCGT (SEQ ID NO: 2 9) 2.3 

TCCAGTTCG - PO3 - 5 ' (SEQ ID NO: 30) 

5 ' - P0 3 - CAAGAGCGT (SEQ ID NO: 31) 2.4 

TCGTTCTCG-P03-5 , (SEQ ID NO: 32) 

5 ' - P0 3 - CAGTTCGGT (SEQ ID NO: 33) 2.5 

TCGTCAAGC-P0 3 -5' (SEQ ID NO: 34) 

5 ' - P0 3 - CGAAGGAGT (SEQ ID NO:35) 2.6 

TCGCTTCCT-P0 3 -5' (SEQ ID NO: 36) 

5 ' - P0 3 - CGGTGTTGT (SEQ ID NO:37) .;. 2.7 

TCGCCACAA-PO3-5' (SEQ ID NO: 38) 

5 ' - PO3 - CGTTGCTGT (SEQ ID NO: 39) 2.8 

TCGCAACGA-P0 3 -5' (SEQ ID NO: 40) 

5 ' - P0 3 - CCGATCTGT (SEQ ID NO:41) 2.9 

TCGGCTAGA- P0 3 - 5 ' (SEQ ID NO: 42) 

5' - PO3-CCTTCTCGT (SEQ ID NO:43) 2.10 

TCGGAAGAG - P0 3 - 5 ' (SEQ ID NO:44) 

5 ' - PO3 - TGAGTCCGT (SEQ ID NO:45) 2.11 

TCACTCAGG-PO3-5' (SEQ ID NO: 46) 

5' -PO3-TGCTACGGT (SEQ ID NO:47) 2.12 

TCAGATTGC - PO3 - 5 ' (SEQ ID NO: 48) 

5 ' - P0 3 -GTGCGTTGA (SEQ ID NO:49) 3.1 

CACACGCAA- PO3 - 5 ' (SEQ ID NO: 50) 

5 ' - PO3 -GTTGGCAGA (SEQ ID NO:51) 3.2 

CACAACCGT - P0 3 - 5 ' (SEQ ID NO: 52) 

5 ' - P0 3 - CCTGTAGGA (SEQ ID NO:53) 3.3 

CAGGACATC-P0 3 -5' (SEQ ID NO: 54) 
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5 ' - P0 3 - CTGCGTAGA (SEQ ID NO:55) 3.4 

CAGACGCAT-P0 3 -5' (SEQ ID NO : 56) 

5 ' - PO3 - CTTACGCGA (SEQ ID NO:57) 3.5 

CAGAATGCG-P0 3 -5' (SEQ ID NO: 58) 

5 ' - P0 3 - TGGTCACGA (SEQ ID NO:59) 3.6 

CAACCAGTG - P0 3 - 5 ' (SEQ ID NO: 60) 

5 ' - P0 3 - TCAGAGCGA (SEQ ID NO:61) 3.7 

CAAGTCTCG-P0 3 -5' (SEQ ID NO: 62) 

5' -PO3-TTGCTCGGA (SEQ ID NO: 63) 3.8 

CAAACGAGC-P0 3 -5' (SEQ ID NO: 64) 

5 ' - P0 3 - GCAGTTGGA (SEQ ID NO:65) 3.9 

CACGTCAAC-P0 3 -5' (SEQ ID NO: 66) 

5 7 -PO3-GCCTGAAGA (SEQ ID NO: 67) 3.10 

CACGGACTT - P0 3 - 5 ' (SEQ ID NO: 68) 

5 ' - PO3 - GTAGCCAGA (SEQ ID NO: 69) 3.11 

CACATCGGT-P0 3 -5' (SEQ ID NO: 70) 

5' -PO3-GTCGCTTGA (SEQ ID NO: 71) 3.12 

CACAGCGAA-PO3-5' (SEQ ID NO: 72) 

5' -PO3-GCCTAAGTT (SEQ ID NO: 73) 4.1 

CTCGGATTC - P0 3 - 5 ' (SEQ ID NO: 74) 

5' -PO3-GTAGTGCTT (SEQ ID NO: 75) 4.2 

CTCATCACG- P0 3 - 5 ' (SEQ ID NO: 76) 

5 ' - P0 3 -GTCGAAGTT (SEQ ID NO: 77) 4.3 

CTCAGCTTC - P0 3 - 5 ' (SEQ ID NO:78) 

5' - PO3-GTTTCGGTT (SEQ ID NO: 79) 4.4 

CTCAAAGCC-P0 3 -5' (SEQ ID NO: 80) 

5 ' - P0 3 - CAGCGTTTT (SEQ ID NO:81) 4.5 

CTGTCGCAA- PO3 - 5 ' (SEQ ID NO: 82) 

5 ' - P0 3 - CATACGCTT (SEQ ID NO:83) 4.6 

CTGTATGCG-PO3-5' (SEQ ID NO: 84) 

5 ' - PO3 - CGATCTGTT (SEQ ID NO:85) 4.7 

CTGCTAGAC - P0 3 - 5 ' (SEQ ID NO: 86) 

5' -PO3-CGCTTTGTT (SEQ ID NO:87) 4.8 

CTGCGAAAC-P0 3 -5' (SEQ ID NO: 88) 
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5 ' - P0 3 - CCACAGTTT (SEQ ID NO:89) 4.9 

CTGGTGTCA-PO3-5' (SEQ ID NO: 90) 

5' - P0 3 -CCTGAAGTT (SEQ ID NO: 91) 4.10 

CTGGACTTC - P0 3 - 5 ' (SEQ ID NO: 92) 

5 ' - P0 3 - CTGACGATT (SEQ ID NO: 93) 4.11 

CTGACTGCT-P0 3 -5' (SEQ ID NO: 94) 

5 ' - PO3 - CTCCACTTT (SEQ ID NO: 95) 4.12 

CTGAGGTGA- PO3 - 5 ' (SEQ ID NO: 96) 

5 ' - P0 3 - ACCAGAGCC (SEQ ID NO:97) 5.1 

AATGGTCTC - PO3 - 5 ' (SEQ ID NO: 98) 

5 ' - P0 3 - ATCCGCACC (SEQ ID NO:99) 5.2 

AATAGGCGT - PO3 - 5 ' (SEQ ID NO: 10 0) 

5 ' - P0 3 - GACGACACC (SEQ ID NO:101) 5.3 

AACTGCTGT - P0 3 - 5 ' (SEQ ID NO: 102) 

5 ' - P0 3 -GGATGGACC (SEQ ID NO:103) 5.4 

AACCTACCT-P03-5 , (SEQ ID NO: 104) 

5 ' - P0 3 -GCAGAAGCC (SEQ ID NO:105) 5.5 

AACGTCTTC- PO3 - 5 ' (SEQ ID NO: 106) 

5' -PO3-GCCATGTCC (SEQ ID NO:107) 5.6 

AACGGTACA- P0 3 - 5 ' (SEQ ID NO: 108) 

5' -PO3-GTCTGCTCC (SEQ ID NO:109) 5.7 

AACAGACGA-P0 3 -5' (SEQ ID NO: 110) 

5 ' - PO3 - CGACAGACC (SEQ ID NO:lll) 5.8 

AAGCTGTCT-P0 3 -5' (SEQ ID NO: 112) 

5' -PO3-CGCTACTCC (SEQ ID NO:113) 5.9 

AAGCGATGA- PO3 - 5 ' (SEQ ID NO: 114) 

5' -PO3- CGACAGACC (SEQ ID NO: 115) 5.10 

AAGGTGTCT - PO3 - 5 ' (SEQ ID NO: 116) 

5' -PO3-CCTCTCTCC (SEQ ID NO: 117) 5.11 

AAGGAGAGA- PO3 - 5 ' (SEQ ID NO: 118) 

5' -PO3-CTCGTAGCC (SEQ ID NO: 119) 5.12 

AAGAGCATC - PO3 - 5 ' (SEQ ID NO: 12 0) 



-42- 



WO 2005/058479 



PCT/US2004/042964 



IX ligase buffer: 50 mM Tris, pH 7.5; 10 mM dithiothreitol; 10 mM MgCl 2 ; 2.5 mM 
ATP; 50 mM NaCl. 

10X ligase buffer: 500 mM Tris, pH 7.5; 100 mM dithiothreitol; 100 mM MgCl 2 ; 25 
mM ATP; 500 mM NaCl 

Cycle 1 

To each of twelve PCR tubes was added 50 (aL of a 1 mM solution of Compound 
1 in water; 75 p,L of a 0.80 mM solution of one of Tags 1.1-1.12; 15 [xL 10X ligase 
buffer and 10 \jlL deionized water. The tubes were heated to 95 °C for 1 minute and then 
cooled to 16 °C over 10 minutes. To each tube was added 5,000 units T4 DNA ligase 
(2.5 ^iL of a 2,000,000 unit/mL solution (New England Biolabs, Cat. No. M0202)) in 50 
jal IX ligase buffer and the resulting solutions were incubated at 16 °C for 16 hours. 

Following ligation, samples were transferred to 1.5 ml Eppendorf tubes and 
treated with 20 juiL 5 M aqueous NaCl and 500 jaL cold (-20 °C) ethanol, and held at -20 v . 
°C for 1 hour. Following centrifugation, the supernatant was removed and the pellet was 
washed with 70% aqueous ethanol at -20 °C. Each of the pellets was then dissolved in 
150 jiL of 150 mM sodium borate buffer, pH 9.4. 

Stock solutions comprising one each of building block precursors BB1 to BB12, 
N,N-diisopropylethanolamine and 0-(7-azabenzotriazol-l-yl)-l,l,3,3- 

tetramethyluronium hexafluorophosphate, each at a concentration of 0.25 M, were 
prepared in DMF and stirred at room temperature for 20 minutes. . The building block 
precursor solutions were added to each of the pellet solutions described above to provide 
a 10-fold excess of building block precursor relative to linker. The resulting solutions 
were stirred. An additional 10 equivalents of building block precursor was added to the 
reaction mixture after 20 minute, and another 10 equivalents after 40 minutes. The final 
concentration of DMF in the reaction mixture was 22%. The reaction solutions were 
then stirred overnight at 4°C. The reaction progress was monitored by RP-HPLC using 
50mM aqueous tetraethylammonium acetate (pH=7.5) and acetonitrile, and a gradient of 
2-46% acetonitrile over 14 min. Reaction was stopped when -95% of starting material 
(linker) is acylated. Following acylation the reaction mixtures were pooled and 
lyophilized to dryness. The lyophilized material was then purified by HPLC, and the 
fractions corresponding to the library (acylated product) were pooled and lyophilized. 
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The library was dissolved in 2.5 ml of 0.01M sodium phosphate buffer (pH = 
8.2) and 0.1ml of piperidine (4% v/v) was added to it. The addition of piperidine results 
in turbidity which does not dissolve on mixing. The reaction mixtures were stirred at 
room temperature for 50 minutes, and then the turbid solution was centrifuged (14,000 
rpm), the supernatant was removed using a 200 fx\ pipette, and the pellet was 
resuspended in 0.1 ml of water. The aqueous wash was combined with the supernatant 
and the pellet was discarded. The deprotected library was precipitated from solution by 
addition of excess ice-cold ethanol so as to bring the final concentration of ethanol in 
the reaction to 70% v/v. Centrifugation of the aqueous ethanol mixture gave a white 
pellet comprising the library. The pellet was washed once with cold 70% aq. ethanol. 
After removal of solvent the pellet was dried in air (~5min.) to remove traces of ethanol 
and then used in cycle 2. The tags and corresponding building block precursors used in 
Round 1 are set forth in Table 1, below. 



Table 1 



Building' 

Block 

Precursor 


Tag 


BB1 


1.11 


BB2 


1.6 


BB3 


1.2 


BB4 


1.8 


BB5 


1.1 


BB6 


1.10 


BB7 


1.12 


BB8 


1.5 


BB9 


1.4 


BB10 


1.3 


BB11 


1.7 


BB12 


1.9 



Cycles 2-5 

For each of these cycles, the combined solution resulting from the previous cycle 
was divided into 12 equal aliquots of 50 ul each and placed in PCR tubes. To each tube 
was added a solution comprising a different tag, and ligation, purification and acylation 
were performed as described for Cycle 1, except that for Cycles 3-5, the HPLC 
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purification step described for Cycle 1 was omitted. The correspondence between tags 
and building block precursors for Cycles 2-5 is presented in Table 2. 

The products of Cycle 5 were ligated with the closing primer shown below, using 
the method described above for ligation of tags. 

5' - P0 3 - GGCACATTGATTTGGGAGTCA 

GTGTAACTAAACCCTCAGT- P0 3 - 5 ' 



Table 2 



Building 

Block 

Precursor 


Cycle 2 
Tag 


Cycle 3 
Tag 


Cycle 4 
Tag 


Cycle 5 
Tag 


BB1 


2.7 


3.7 


4.7 


5.7 


BB2 


2.8 


3.8 


4.8 


5.8 


BB3 


2.2 


3.2 


4.2 


5.2 


BB4 


2.10 


3.10 


4.10 


5.10 


BB5 


2.1 


3.1 


4.1 


5.1 


BB6 


2.12 


3.12 


4.12 


5.12 


BB7 


2.5 


3.5 


4.5 


5.5 


BB8 


2.6 


3.6 


4.6 


5.6 


BB9 


2.4 


3.4 


4.4 


5.4 


BB10 


2.3 


3.3 


4.3 


5.3 


BB11 


2.9 


3.9 


4.9 


5.9 


BB12 


2.11 


3.11 


4.11 


5.11 



Results: 

The synthetic procedure described above has the capability of producing a library 
comprising 12 5 (about 249,000) different structures. The synthesis of the library was 
monitored via gel electrophoresis of the product of each cycle. The results of each of the 
five cycles and the final library following ligation of the closing primer are illustrated in 
Figure 7. The compound labeled "head piece" is Compound 1. The figure shows that 
each cycle results in the expected molecular weight increase and that the products of 
each cycle are substantially homogeneous with regard to molecular weight. 



Example 2: Synthesis and Characterization of a library on the order of 10* members 
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The synthesis of a library comprising on the order of 10 distinct members was 
accomplished using the following reagents: 

Compound 2: 




Single letter codes for deoxyribonucleotides: 
A = adenosine 
C = cytidine 
G = guanosine 
T = thymidine 

Building block precursors: 
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Fmoc- 




Fmoc- N ^-- OH 
H O 



BB25 




1ST XT 
Fmoc OH 

BB26 



0 



CI 



Fmoc-N 



N 



BB23 



Fmoc — N 
H 




OH 



OH 



BB24 



BB27 



NH 

Fmoc 
BB28 



Fmoc- 




OH 



Fmoc- 




Fmoc 




BB31 



Fmoc-N 
H 



OH 



BB32 



OH 




Fmoc 



NH 



OH HO 



Fmoc 
BB33 




BB34 



Fmoc' 




BB35 
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Fmoc ^ 




0 2 N 



Fmoc 



BB36 



or 

BB37 



N C 
Fmoc O 



BB38 



OH 
O 



Fmoc 



Fmoc 



Fmoc 



H 



BB40 



BB41 



v OH 
BB42 



HN 

l i 
Fmoc OH 




Fmoc 



7V"'N' 

k H 

O^OH 



BB44 



BB45 



BB46 



Fmoc 



,N^L^OH 



Fmoc. 





BB48 



x 



BB49 



HN C 
l " 
Fmoc O 



BB50 



HN C 
Fmoc O 



.OH 



Fmoc 



H 




BB39 



Fmoc » 



H 5 



OH 



BB43 



/ 

Fmoc. ^.OH 
H I 



BB47 



Fmoc „ 




-jpOH 

o 



Frroc 



BB52 



BB53 



BB54 



Fmoc ^ /OH 

H » 
M O 



BB55 
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FlTDC ^,>\/OH 



M r 

BB57 




» H c 



Fmoc— N 

HO'^O 
BB58 



o 




FfTDC 



^ „ N y N J ° 

FmDC— NHJC=0 O ^OH 



HO 
BB59 



BB60 BB61 



hh^c-OH k^OH X 




BB69 BB7 ° 
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0 2 N 




FlTDC . 



OH 



NH 2 



Fmoc . 



H 5 

BB71 



tTY 



OH 



N ^C' 

FlTDC O 



,OH 



BB74 



BB72 



Fmoc 



NOc 



O 
BB75 



Fmoc 



BB78 



I ii 

Fmoc O 



BB81 




BB780 



,OH 



BB83 



0 

Fmoc 
BB84 



Fmoc 



BB85 



OH Fmoc. 



H 5 



BB86 



OH 
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Table 3: Oligonucleotide tags used in cycle 1: 



Tag 

Number Top Strand Sequence 
5'-P03- 

AAATCGATGTGGTCACTCAG 

1.1 (SEQ ID NO: 121) 
5'-P03- 

AAATCGATGTGGACTAGGAG 

1.2 (SEQ ID NO: 123) 
5'-P03- 

AAATCGATGTGCCGTATGAG 

1.3 (SEQ ID NO: 125) 
5'-P03- 

AAATCGATGTGCTGAAGGAG 

1.4 (SEQ ID NO: 127) 
5'-P03- 

AAATCGATGTGGACTAGCAG 

1.5 (SEQ ID NO: 12 9) 
5'-P03- 

AAATCGATGTGCGCTAAGAG 

1.6 (SEQ ID NO: 131) 



Bottom Strand Sequence 
5'-P03- 

GAGTGACCACATCGATTTGG 
, (SEQ ID NO: 122) 
5'-P03- 

CCTAGTCCACATCGATTTGG 
(SEQ ID NO: 124) 
5'-P03- 

CATACGGCACATCGATTTGG 
(SEQ ID NO: 12 6) 
5'-P03- 

CCTTCAGCACATCGATTTGG 
(SEQ ID NO: 128) 
5'-P03- 

GCTAGTCCACATCGATTTGG 
(SEQ ID NO:130) 
5'-P03- 

CTTAGCGCACATCGATTTGG 
(SEQ ID NO: 132) 
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5'-P03- 


5'-P03- 




AAATCGATGTGAGCCGAGAG 


CTPGGPTP APA TPG A TTTnn 


1.7 


(SEQ ID NO: 133) 


(SEO ID NO- 134) 




5'-P03- 


5'-P03- 




AAATCGATGTGCCGTATCAG 


GATACGGfArATrGATTTnn 


1.8 


(SEQ ID NO : 135) 


(SEO ID NO • 1"36) 




5'-P03- 


5'-P03- 




AAATCGATGTGCTGAAGCAG 


GCTTC A GP AC A TP O A TTTPP 


1.9 


(SEQ ID NO:137) 


(SEO ID NO • 138 ) 




5'-P03- 


5'-P03- 




AAATCGATGTGTGCGAGTAG 


A PTPOP A P A P A TPH A TTTPtPt 


1.10 


(SEQ ID NO : 139) 


(SEO ID NO ■ 1 4 0 ) 




5'-P03- 


5'-P03- 




AAATCGATGTGTTTGGCGAG 


CGCCAAACACATCGATTTGG 


1.11 


(SEQ ID NO: 141) 


(SEQ ID NO : 142 ) 




5'-P03- 


5'_P03- 




AAATCGATGTGCGCTAACAG 


GTTAGCGCACATCGATTTGG 


1.12 


(SEQ ID NO: 143) 


(SEQ ID NO : 144 ) 




5'-P03- 


5'-P03- 




AAATCGATGTGAGCCGACAG 


GTPGGPTP APA TPO A TTTHP 

VJ J. V-/VJVJ V_x A V_^xTlV_x^\. J. V_/VJ.fA. 1X1 VJVJ 


1.13 


(SEQ ID NO : 145) 


(SEO ID NO- 146) 




5'-P03- 


5'-P03- 




AAATCGATGTGAGCCGAAAG 


TTCGGPTP APA TPG A TTTGG 


1.14 


(SEQ ID NO: 147) 


(SEO ID NO- 14 8) 




5'-P03- 






AAATCGATGTGTCGGTAGAG 


GT A PPG A P A P A TP O A TTTPP 

^ ■•■ i».v>Vyvj/\V'nLV//\ i V/VJ/\ l l 1 vjvJ 


1.15 


(SEQ ID NO: 149) 


(SEO ID NO- ISO) 




5'-P03- 


V-PCH- 




AAATCGATGTGGTTGCCGAG 


PGGP A A PP APA TPO A TTTPtrt 

V-'VJ vj V-zriii^v^/w^ri 1 v_x VJ^Y 1 l 1 VJ vj 


1.16 


(SEQ ID NO: 151) 


(SEO ID NO -IS?) 




5'-P03- 


«J X VJ" J 




AAATCGATGTGAGTGCGTAG 


A POP A PTP APA TPP A TTTPP 
n.v-uV'n.V' x v^Av/A. x v^vJ^A. Ill VjvJ 


1.17 


(SEQ ID NO: 153) 


(SEO TD NO-1R4) 




5'-P03- 






AAATCGATGTGGTTGCCAAG 


TOPP A A PP A P A TPPt A TTTPtPt 

1 vJ\JV>A\/\vyV'A.vy/\ 1 V-xVXrY 111 vJLJ 


1.18 


(SEQ ID NO: 155) 


(SEO ID NO- 1 Rfi) 




5'-P03- 


5'-P03- 




AAATCGATGTGTGCGAGGAG 


GGTPGP A P A P A TPG A TTTPO 

v>\^ 1 VVJ V//l\/ AV^A 1 v/VJ A 1 X X VJ VJ 


1.19 


(SEQ ID NO: 157) 


(SEO ID NO- 158) 




5'-P03- 


5'-P03- 




AAATCGATGTGGAACACGAG 


GGTGTTPP APA TPO A TTTOG 

V_/VJ A VJ X A v^vvA.V/A X V_/VJ^V XXX VJVJ 


1.20 


(SEQ ID NO: 159) 


(SEO ID NO- 160) 




5'-P03- 


5'_P03- 




AAATCGATGTGCTTGTCGAG 


PGAPA AOP A P A TPP A TTTPtP 

v^VJj^V^/^r\.VJv_^/A.v_x/^\. 1 V_^vJ/\ 111 vJvJ 


1.21 


(SEQ ID NO: 161) 


(SEO ID NO -1^9) 




5'-P03- 


«J ~ x v_y .J 




AAATCGATGTGTTCCGGTAG 


AOPPPOA AP AP ATPPtATTTPr: 

/^\J v^ v^VJ VJ/\_rA.V^-^Vv^/A. 1 VwVj/\ 111 VJVJ 


1.22 


(SEQ ID NO: 163) 


(SEQ ID NO: 164) 




5'-P03- 


5'-P03- 




AAATCGATGTGTGCGAGCAG 


GCTCGCACACATCGATTTGG 


1.23 


(SEQ ID NO: 165) 


(SEQ ID NO: 166) 




5'-P03- 


5'-P03- 


1.24 


AAATCGATGTGGTCAGGTAG 


ACCTGACCACATCGATTTGG 
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5'-P03- 


5'-P03- 




AAATCGATGTGGCCTGTTAG 


AACAGGCCACATCGATTTGG 


1.25 


(SEQ ID NO: 169) 


(SEQ ID NO: 170) 




5'-P03- 


5'-P03- 




AAATCGATGTGGAACACCAG 


GGTGTTCCACATCGATTTGG 


1.26 


(SEQ ID NO: 171) 


(SEQ ID NO: 172) 






5'-P03- 




5 ' -P03 - AAATCGATGTGCTTGTCC AG 


GGACAAGCACATCGATTTGG 


1.27 


(SEQ ID NO: 173) 


(SEQ ID NO: 174) 




5'-P03- 


5'-P03- 




AAATCGATGTGTGCGAGAAG 


TCTCGCACACATCGATTTGG 


1.28 


(SEQ ID NO: 175) 


(SEQ ID NO: 176) 




5'-P03- 


5'-P03- 




AAATCGATGTGAGTGCGGAG 


CCGCACTCACATCGATTTGG 


1.29 


(SEQ ID NO: 177) 


(SEQ ID NO:178) 




5'-P03- 


5'-P03- 




AAATCGATGTGTTGTCCGAG 


CGGACAACACATCGATTTGG 


1.30 


(SEQ ID NO: 179) 


(SEQ ID NO: 180) 




5'-P03- 


5'-P03- 




AAATCGATGTGTGGAACGAG 


CGTTCCACACATCGATTTGG 


1.31 


(SEQ ID NO: 181) 


(SEQ ID NO: 182) 




5'-P03- 


5'-P03- 




AAATCGATGTGAGTGCGAAG 


TCGCACTCACATCGATTTGG 


1.32 


(SEQ ID NO: 183) 


(SEQ ID NO: 184) 




5'-P03- 


5'-P03- 




AAATCGATGTGTGGAACCAG 


GGTTCCACACATCGATTTGG 


1.33 


(SEQ ID NO: 185) 


(SEQ ID NO: 186) 




5'-P03- 


5'-P03- 




AAATCGATGTGTTAGGCGAG 


CGCCTAACACATCGATTTGG 


1.34 


(SEQ ID NO: 187) 


(SEQ ID NO: 188) 




5'-P03- 


5'-P03- 




AAATCGATGTGGCCTGTGAG 


CACAGGCCACATCGATTTGG 


1.35 


(SEQ ID NO: 189) 


(SEQ ID NO: 190) 






5'-P03- 




5 ' -P03 -AAATCGATGTGCTCCTGTAG 


ACAGGAGCACATCGATTTGG 


1.36 


(SEQ ID NO: 191) 


(SEQ ID NO: 192) 




5'-P03- 


5'-P03- 




AAATCGATGTGGTCAGGCAG 


GCCTGACCACATCGATTTGG 


1.37 


(SEQ ID NO: 193) 


(SEQ ID NO: 194) 




5'-P03- 


5'-P03- 




AAATCGATGTGGTCAGGAAG 


TCCTGACCACATCGATTTGG 


1.38 


(SEQ ID NO: 195) 


(SEQ ID NO: 196) 




5'-P03- 


5'-P03- 




AAATCGATGTGGTAGCCGAG 


CGGCTACCACATCGATTTGG 


1.39 


(SEQ ID NO: 197) 


(SEQ ID NO: 198) 




5'-P03- 


5'-P03- 




AAATCGATGTGGCCTGTAAG 


TACAGGCCACATCGATTTGG 


1.40 


(SEQ ID NO: 199) 


(SEQ ID NO:200) 




5'-P03- 


5'-P03- 




AAATCGATGTGCTTTCGGAG 


CCGAAAGCACATCGATTTGG 


1.41 


(SEQ ID NO:201) 


(SEQ ID NO:202) 
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5'-P03- 

AAATCGATGTGCGTAAGGAG 

1.42 (SEQ ID NO: 203) 
5'-P03- 

AAATCGATGTGAGAGCGTAG 

1.43 (SEQ ID NO: 205) 
5'-P03- 

AAATCGATGTGGACGGCAAG 

1.44 (SEQ ID NO: 207) 

5 ' -P03 - AAATCG ATGTGCTTTCGC AG 

1.45 (SEQ ID NO:209) 
5'-P03- 

AAATCGATGTGCGTAAGCAG 

1.46 (SEQ ID NO:211) 
5'-P03- 

AAATCGATGTGGCTATGGAG 

1.47 (SEQ ID NO: 213) 
5'-P03- 

AAATCGATGTGACTCTGGAG 

1.48 (SEQ ID NO:215) 



5'-P03- 

CCTTACGCACATCGATTTGG 
(SEQ ID NO: 2 04) 
5'-P03- 

ACGCTCTCACATCGATTTGG 
(SEQ ID NO: 2 06) 
5'-P03- 

TGCCGTCCACATCGATTTGG 
(SEQ ID NO:208) 
5'-P03- 

GCGAAAGCACATCGATTTGG 
(SEQ ID NO: 2 10) 
5'-P03- 

GCTTACGCACATCGATTTGG 
(SEQ ID NO: 2 12) 
5'-P03- 

CCATAGCCACATCGATTTGG 
(SEQ ID NO: 2 14) 
5'-P03- 

CCAGAGTCACATCGATTTGG 
(SEQ ID NO: 216) 



5'-P03-AAATCGATGTGCTGGAAAG 

1.49 (SEQ ID NO:217) 
5'-P03- 

AAATCGATGTGCCGAAGTAG 

1.50 (SEQ ID NO:219) 
5'-P03- 

AAATCGATGTGCTCCTGAAG 

1.51 (SEQ ID NO:221) 
5'-P03- 

AAATCGATGTGTCCAGTCAG 

1.52 (SEQ ID NO: 223) 
5'-P03- 

AAATCGATGTGAGAGCGGAG 

1.53 (SEQ ID NO:225) 
5'-P03- 

AAATCGATGTGAGAGCGAAG 

1.54 (SEQ ID NO: 22 7) 
5'-P03- 

AAATCGATGTGCCGAAGGAG 

1.55 (SEQ ID NO:229) 
5'-P03- 

AAATCGATGTGCCGAAGCAG 

1.56 (SEQ ID NO:231) 
5'-P03- 

AAATCGATGTGTGTTCCGAG 

1.57 (SEQ ID NO: 233) 
5'-P03- 

AAATCGATGTGTCTGGCGAG 

1.58 (SEQ ID NO:235) 
5'-P03- 

1.59 AAATCGATGTGCTATCGGAG 



5'-P03- 

TTCCAGCACATCGATTTGG 
(SEQ ID NO:218) 
5'-P03- 

ACTTCGGCACATCGATTTGG 
(SEQ ID NO:220) 
5'-P03- 

TCAGGAGCACATCGATTTGG 
(SEQ ID NO:222) 
5'-P03- 

GACTGGACACATCGATTTGG 
(SEQ ID NO:224) 
5'-P03- 

CCGCTCTCACATCGATTTGG 
(SEQ ID NO:226) 
5'-P03- 

TCGCTCTCACATCGATTTGG 
(SEQ ID NO: 22 8) 
5'-P03- 

CCTTCGGCACATCGATTTGG 
(SEQ ID NO:230) 
5'-P03- 

GCTTCGGCACATCGATTTGG 
(SEQ ID NO:232) 
5'-P03- 

CGGAACACACATCGATTTGG 
(SEQ ID NO:234) 
5'-P03- 

CGCCAGACACATCGATTTGG 
(SEQ ID NO:236) 
5'-P03- 

CCGATAGCACATCGATTTGG 
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(SEQ ID NO:237) 


(SEQ ID NO:238) 




J -ruj- 






A A A TPPr A TCYTC±C*C± AAA dtCl A O 

-TY/Aj'A. 1 V_xVJ/\ 1U1 VJV^VJ/\/\/\UvJ/vVJ 


rrTTTrnr apa tc^ci a tttPtPt 

Vw-^ ill ^-vJv^/ vv^y v i ^vjy v ill UvJ 


1 £0 


V OILV^ X U 1\I VJ . Z. -5 ^ 






J -Jr L) J- 


^' pn^ 




AAA 1 LuA lul LrCULr AAvj AAu 


TPTTpPPP APA TPP A TTTPP 

1 tl 1 L^vjLjU AC A 1 tuA 111 kj{j 


1 /C 1 

l.ol 


/ o "CO T "Hi Vr/""* . /i -i \ 

VoJiy -i- -LJ JNJvJ:^41; 


VoJcj^ ID JNIU:z4z; 






j -1UJ- 




a a a tpp a tptppttpp at a p 
AAA 1 CvjA 1 Lr 1 LrLr 1 IvjwVvjAvj 


PTPP A A PP APA TPP A TTTPP 

L 1 uL AALLAL A 1 LuA 1 1 ICjCj 


1 /CO 

1.62 


vbiiQ ID JNJ(J:^43; 


VoJiQ ID JNIU:z44; 




^ ' DAI 

J -rvJ3- 






A A A TPP A TPTPP A TP P TP A P 

AAA 1 CLrA 1 Lr 1 VjLrA 1 LrLr 1 LjALj 


P A PP A TPP APA TPP A TTTPP 

CAUL, A 1 LLAtA 1 CUA 111 Kj\j 


1 /C3 
1.63 


/ C 1 T7P. T n TvT/^ . O A C \ 


^bhi^ ID JMU:z46; 




5 -rU3- 


J -rU3- 




A A A TPP A TPTPPT A TPPP A P 

AAA 1 CLiA 1 Lr 1 LrC 1 A 1 LuLAu 


PPP ATAPPAPA TPP A TTTPP 

LrCCj A 1 ACjCAC A 1 CLrA 111 kjKj 


1 £ A 

1.64 


/ C? T?r\ T n . n /I "7 \ 

Voliy ID JNIU:-d4/; 






3 -rvJ3- 






A A A TPP A TPTP PP A A ATP AP 

AAA 1 UvjA lul utuAAAutAU 


PPTTTPPP APA TPP A TTTPP 

IjC 111 LuLALA 1 tuA 111 KjKj 


i /cc 

1 .OJ 


/ C TTT^ Tn lVT^ ■ O ^1 Q \ 


( CITP TD TvTO . O CI Pi \ 










a a a tpp a tptp apa pTPir, a p 
AAA I tuA lul uALAL 1 LrLrALr 


pp A PTPTP APA TPP A TTTPP 

ULAu l\j 1 LALA 1 tuA 111 LrVj 


1 .DO 


VoJiv -1-D 1NU : zol J 


/ cup tv\ ~Kir\ . o c: o \ 
loiiv ±D JN<j : z dz / 




3 -.rvJ:)- 


J -rvJj- 




A A A TPP A TPTPTPTPPP A A P 
AAA 1 LAjA 1 Lr 1 Lr 1 1 uuLAAu 


TP PP A P A P A P A TPP A TTTPP 

1 ULLAuALALA I LAjA 1 1 ILtvj 


1 an 
l.o/ 


( C "C 1 C\ TTl "MO . O CI O \ 

VoJcj^ ID JNU : ZD j J 


VoUj^ -LD JNU:zD4y 




3 -rUj- 


J -rU3- 




A A A TPH A TPTPP A TPiP^TP A P 
AAA 1 LuA 1 Lr 1 uuA 1 LrLr 1 tAu 


p A PP A TPP APA TPP A TTTPP 
uAttA 1 LLALA 1 LuA 111 KjKj 


1 /CO 




/CUP T n "NTP\ . o c c \ 

VbiiQ ID JNJUizob; 




3 -rvjj- 


J -rU3- 




A A A TPP A TPTPPTTPP A P A P 

AAA 1 CLrA 1 Lr 1 LrLr 1 1 Lrv^AL, ALr 


PTPP A A PP APA TPP A TTTPP 

Cj 1 uLAALCAtA 1 UvjA 111 uu 


1 /Cft 

1.69 


^bhiQ ID JMU:zb/; 


^bhiC^ ID JNUizboj 




J -rU3- 


r ? TJPil PP A TPPPPP A TPPP A 




A A A TPP A Ti^T^r^r^/^ 1 A TPP A P 

AAA 1 tuA 1 Lr 1 LrLiLrL. A 1 CLrALr 


TTT PP 
111 \3\J 


1 

1.70 


^biiy ID JMU:zoy; 


( c? "c r\ Tn ts.tp\ . o a c\ \ 
ibhiy ID JNIUrzoU; 






^' pn^ 






PP A PPP A P A P A TPP A TTTPP 
OUAuULAuALA 1 LUA 111 vjvj 


1 71 


V Oi-jy J.LJ 1NU . Z D 1 / 


/cup Tn 










aaa Trr; a TOTnTnr , r i TP a ah 


Tn a oprp apapa Trn a tttoo 


1 10 


\OD y^J -L i-J IN v_/ . O j / 


( QPD TD 




j -rUj- 


^' T>P*1 
D -i vJJ- 




a a a nrr^ri a TPiTnnnr a tpp a n 

AAA 1 C-ljA lul uuutA 1 LLAu 


pp A TPPPP APA TPP A TTTPP 

vjVjA 1 uLLLALA 1 CvjA ill Vjvj 


1 ni 

1.73 


/ ClT?r\ TT\ TvTP> . OCC^ 

\ £>hj\2 -ID JNU: zoo j 


( C2~u?r\ t n ~k\c\ » ^ a a \ 
^biiy ID JMUl^bb; 




j> -r(J3- 


C 5 DPil TP A TPPPP A PAT PP A 

D -rCJ3-l vjAI VjUCCA LAI LLrA 




A A A TPP A TPTPPPP A TP A AP 

AAA 1 CCjA 1 Cj 1 OCjCjC A 1 C AACj 


TTT* P* t ' 
111 VJVJ 


1.74 


( bliy ±D JMO:zo7; 


/ O TT 1 /^ Tn "KT/*\ . "~> O \ 

VbfciQ ID rslL):26o; 






C 5 DAO PP A PAP PPA PAT 

3 -rVJ J -LLrA LALr VjCA LAI 




AAATCGATGTGCCTGTCGAG 


CGA TTT GG 


1.75 


(SEQ ID NO:269) 


(SEQ ID NO:270) 




5'-P03- 


5'-P03-ATC CGT CCA CAT 




AAATCGATGTGGACGGATAG 


CGA TTT GG 


1.76 


(SEQ ID NO:271) 


(SEQ ID NO: 272) 
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5'-P03- 

AAATCGATGTGCCTGTCCAG 

1.77 (SEQ ID NO: 273) 
5'-P03- 

AAATCGATGTGAAGCACGAG 

1.78 (SEQ ID NO:275) 
5'-P03- 

AAATCGATGTGCCTGTCAAG 

1.79 (SEQ ID NO: 277) 
5'-P03- 

AAATCGATGTGAAGCACCAG 

1.80 (SEQ ID NO:279) 

5 ' -P03 - AAATCGATGTGCCTTCGTAG 

1.81 (SEQ ID NO:281) 
5'-P03- 

AAATCGATGTGTCGTCCGAG 

1.82 (SEQ ID NO:283) 
5'-P03- 

AAATCGATGTGGAGTCTGAG 

1.83 (SEQ ID NO:285) 
5'-P03- 

AAATCGATGTGTGATCCGAG 

1.84 (SEQ ID NO:287) 



5'-P03-GGA CAG GCA CAT 
CGA TTT GG 

(SEQ ID NO: 2 74) 
5 '-P03-CGT GCT TCA CAT 
CGA TTT GG 

(SEQ ID NO:276) 
5'-P03-TGA CAG GCA CAT 
CGA TTT GG 

(SEQ ID NO: 2 78) 
5 '-P03-GGT GCT TCA CAT 
CGA TTT GG 

(SEQ ID NO: 2 80) 
5'-P03-ACG AAG GCA CAT 
CGA TTT GG 

(SEQ ID NO:282) 
5'-P03-CGG ACG ACA CAT 
CGA TTT GG 

(SEQ ID NO:284) 
5'-P03-CAG ACT CCA CAT 
CGA TTT GG 

(SEQ ID NO:286) 
5'-P03-CGG ATC ACA CAT 
CGA TTT GG 

(SEQ ID NO: 2 88) 



5'-P03- 

AAATCGATGTGTCAGGCGAG 

1.85 (SEQ ID NO:289) 
5'-P03- 

AAATCGATGTGTCGTCCAAG 

1.86 (SEQ ID NO: 291) 
5'-P03- 

AAATCGATGTGGACGGAGAG 

1.87 (SEQ ID NO: 293) 
5'-P03- 

AAATCGATGTGGTAGCAGAG 

1.88 (SEQ ID NO: 295) 
5'-P03- 

AAATCGATGTGGCTGTGTAG 

1.89 (SEQ ID NO:297) 
5'-P03- 

AAATCGATGTGGACGGACAG 

1.90 (SEQ ID NO: 299) 
5'-P03- 

AAATCGATGTGTCAGGCAAG 

1.91 (SEQ ID NO:301) 
5'-P03- 

AAATCGATGTGGCTCGAAAG 

1.92 (SEQ ID NO: 3 03) 
5'-P03- 

AAATCGATGTGCCTTCGGAG 

1.93 (SEQ ID NO: 305) 
5'-P03- 

1.94 AAATCGATGTGGTAGCACAG 



5'-P03-CGC CTG ACA CAT 
CGA TTT GG 

(SEQ ID NO:290) 
5'-P03-TGG ACG ACA CAT 
CGA TTT GG 

(SEQ ID NO: 2 92) 
5'-P03-CTC CGT CCA CAT 
CGA TTT GG 

(SEQ ID NO:294) 
5'-P03-CTG CTA CCA CAT 
CGA TTT GG 

(SEQ ID NO:296) 

5'-P03- 

ACACAGCCACATCGATTTGG 

(SEQ ID NO:298) 
5'-P03-GTC CGT CCA CAT 
CGA TTT GG 

(SEQ ID NO:300) 
5'-P03-TGC CTG ACA CAT 
CGA TTT GG 

(SEQ ID NO: 3 02) 

5'-P03- 

TTCGAGCCACATCGATTTGG 

(SEQ ID NO:304) 
5'-P03-CCG AAG GCA CAT 
CGA TTT GG 

(SEQ ID NO:306) 
5'-P03-GTG CTA CCA CAT 
CGA TTT GG 
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(SEQ ID NO:307) 
5'-P03- 

AAATCGATGTGGAAGGTCAG 
(SEQ ID NO:309) 
5'-P03- 

AAATCGATGTGGTGCTGTAG 
(SEQ ID NO: 311) 



(SEQ ID NO:308) 

5'-P03-GAC CTT CCA CAT 
CGA TTT GG 
(SEQ ID NO:310) 
5'-P03-ACA GCA CCA CAT 
CGA TTT GG 
(SEQ ID NO: 312) 



Table 4: Oligonucleotide tags used in cycle 2: 



Tag 

Number 

2.1 

2.2 

2.3 

2.4 

2.5 

2.6 

2.7 

2.8 

2.9 

2.10 

2.11 

2.12 

2.13 

2.14 

2.15 

2.16 

2.17 

2.18 

2.19 



Top strand sequence 



Bottom strand sequence 



5'-P03-GTT GCC TGT 

(SEQ ID NO:313) 
5'-P03-CAG GAC GGT 

(SEQ ID NO: 3 15) 
5 '-P03-AGA CGT GGT 

(SEQ ID NO:317) 
5'-P03-CAG GAC CGT 

(SEQ ID NO:319) 
5'-P03-CAG GAC AGT 

(SEQ ID NO:321) 
5 '-P03-CAC TCT GGT 

(SEQ ID NO:323) 
5'-P03-GAC GGC TGT 

(SEQ ID NO:325) 
5'-P03-CACTCTCGT 

(SEQ ID NO:327) 
5'-P03-GTAGCCTGT 

(SEQ ID NO:329) 
5'-P03-GCC ACT TGT 

(SEQ ID NO:331) 
5'-P03-CATCGCTGT 

(SEQ ID NO:333) 
5'-P03-CAC TGG TGT 

(SEQ ID NO: 335) 



5'-P03-AGG CAA CCT 
(SEQ ID NO:314) 
5'-P03-CGTCCT GCT 
(SEQ ID NO: 316) 
5'-P03-CACGTCTCT 

(SEQ ID NO:318) 
5'-P03-GGTCCTGCT 
(SEQ ID NO:320) 
5'-P03-TGT CCT GCT 
(SEQ ID NO: 322) 
5'-P03-CAG AGT GCT 
(SEQ ID NO:324) 
5'-P03-AGCCGTCCT 
(SEQ ID NO:326) 
5'-P03-GAG AGT GCT 
(SEQ ID NO:328) 
5'-P03-AGGCTACCT 

(SEQ ID NO:330) 
5'-P03-AAG TGG CCT 

(SEQ ID NO:332) 
5'-P03-AGC GAT GCT 

(SEQ ID NO:334) 
5'-P03-ACC AGT GCT 

(SEQ ID NO: 336) 



5'-P03-GCC ACT GGT 

(SEQ ID NO: 337) 
5'-P03-TCTGGCTGT 

(SEQ ID NO:339) 
5'-P03-GCC ACT CGT 

(SEQ ID NO:341) 
5'-P03-TGC CTC TGT 

(SEQ ID NO: 343) 
5'-P03-CAT CGC AGT 

(SEQ ID NO:345) 
5'-P03-CAG GAA GGT 

(SEQ ID NO:347) 
5'-P03-GGC ATC TGT 

(SEQ ID NO:349) 



5'-P03-CAG TGG CCT 

(SEQ ID NO:338) 
5'-P03-AGC CAG ACT 

(SEQ ID NO:340) 
5'-P03-GAG TGG CCT 

(SEQ ID NO:342) 
5'-P03-AGA GGC ACT 

(SEQ ID NO: 344) 
5'-P03-TGC GAT GCT 

(SEQ ID NO: 34 6) 
5'-P03-CTT CCT GCT 

(SEQ ID NO:348) 
5'-P03-AGA TGC CCT 

(SEQ ID NO:350) 



-58- 



WO 2005/058479 



PCT/US2004/042964 



2.20 
2.21 
2.22 
2.23 
2.24 
2.25 
2.26 
2.27 
2.28 
2.29 
2.30 
2.31 
2.32 
2.33 
2.34 
2.35 
2.36 
2.37 
2.38 
2.39 
2.40 
2.41 
2.42 
2.43 
2.44 
2.45 



5 '-P03-CGG TGC TGT 5 '-P03 

(SEQ ID NO:351) (SEQ 

5'-P03-CACTGGCGT 5'-P03 

(SEQ ID NO: 353) (SEQ 

5>_P03-TCTCCTCGT 5'-P03 

(SEQ ID NO:355) (SEQ 

5 '-P03-CCT GTC TGT 5 '-P03 

(SEQ ID NO:357) (SEQ 

5 '-P03-CAA CGC TGT 5 '-P03 

(SEQ ID NO:359) (SEQ 



-AGC ACC GCT 

ID NO:352) 
-GCC AGT GCT 

ID NO: 354) 
-GAGGAGACT 

ID NO: 356) 
-AGA CAG GCT 

ID NO:358) 
-AGC GTT GCT 

ID NO:360) 



5 '-P03-TGC CTC GGT 5 '-P03- 

(SEQ ID NO: 361) (SEQ 

5'-P03-ACACTGCGT 5'-P03- 

(SEQ ID NO:363) (SEQ 

5 '-P03-TCG TCC TGT 5 '-P03- 

(SEQ ID NO:365) (SEQ 

5 '-P03-GCT GCC AGT 5 '-P03- 

(SEQ ID NO: 367) (SEQ 

5 '-P03-TCA GGC TGT 5 '-P03- 

(SEQ ID NO: 369) (SEQ 

5 '-P03-GCC AGG TGT 5 '-P03 

(SEQ ID NO: 371) (SEQ 

5 '-P03-CGG ACC TGT 5 '-P03- 

(SEQ ID NO:373) ■ (SEQ 

5'-P03-CAA CGC AGT 5'-P03 

(SEQ ID NO: 375) (SEQ 

5 '-P03-CAC ACG AGT 5 '-P03 

(SEQ ID NO: 377) (SEQ 

5 '-P03-ATG GCC TGT 5 '-P03 

(SEQ ID NO: 379) (SEQ 

5 '-P03-CCA GTC TGT 5 '-P03 

(SEQ ID NO:381) (SEQ 

5'-P03-GCCAGGAGT 5'-P03 

(SEQ ID NO:383) (SEQ 



CGA GGC ACT 
ID NO: 362) 
GCA GTG TCT 
ID NO: 364) 
AGG ACG ACT 
ID NO:366) 
TGG CAG CCT 
ID NO:368) 
AGC CTG ACT 
ID NO:370) 
ACC TGG CCT 
ID NO: 372) 
AGG TCC GCT 
ID NO: 3 74) 
TGC GTT GCT 
ID NO: 376) 
TCG TGT GCT 
ID NO:378) 
AGG CCA TCT 
ID NO:380) 
•AGA CTG GCT 
ID NO: 382) 
TCC TGG CCT 
ID NO:384) 



5 '-P03-CGG ACC AGT 5 '-P03 

(SEQ ID NO: 385) (SEQ 

5 '-P03-CCT TCG CGT 5 '-P03 

(SEQ ID NO: 387) (SEQ 

5 '-P03-GCA GCC AGT 5 '-P03 

(SEQ ID NO: 389) (SEQ 

5'-P03-CCAGTCGGT 5'-P03 

(SEQ ID NO:391) (SEQ 

5 '-P03 -ACT GAG CGT 5 ' -P03 

(SEQ ID NO:393) (SEQ 

5 '-P03-CCA GTC CGT 5 '-P03 

(SEQ ID NO: 3 95) (SEQ 

5 '-P03-CCA GTC AGT 5 '-P03 

(SEQ ID NO:397) (SEQ 

5 '-P03-CAT CGA GGT 5 '-P03 

(SEQ ID NO:399) (SEQ 

5 '-P03-CCA TCG TGT 5 '-P03 

(SEQ ID NO:401) (SEQ 



TGG TCC GCT 
ID NO:386) 
GCG AAG GCT 
ID NO:388) 
TGG CTG CCT 
ID NO:390) 
CGA CTG GCT 
ID NO:392) 
GCT CAG TCT 
ID NO: 394) 
-GGA CTG GCT 
ID NO: 3 96) 
-TGA CTG GCT 
ID NO:398) 
-CTC GAT GCT 
ID NO:400) 
ACG ATG GCT 
ID NO:402) 
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2.46 
2.47 



2.48 
2.49 
2.50 
2.51 
2.52 
2.53 
2.54 
2.55 
2.56 
2.57 
2.58 
2.59 



2.60 
2.61 
2.62 
2.63 
2.64 
2.65 
2.66 
2.67 
2.68 
2.69 
2.70 
2.71 



5'-P03-GTG CTG CGT 5'-P03 

(SEQ ID NO:403) (SEQ 

5'-P03-GACTACGGT 5'-P03 

(SEQ ID NO:405) (SEQ 

5 '-P03-GTG CTG AGT 5 '-P03 

(SEQ ID NO:407) (SEQ 



-GCA GCA CCT 
ID NO:404) 

-CGT AGT CCT 
ID NO: 4 06) 

-TCA GCA CCT 
ID NO: 4 08) 



5'-P03-GCTGCATGT 5'-P03 

(SEQ ID NO: 4 09) (SEQ 

5 '-P03-GAGTGGTGT 5'-P03 

(SEQ ID NO:411) (SEQ 

5'-P03-GACTACCGT 5'-P03 

(SEQ ID NO:413) (SEQ 

5'-P03-CGGTGATGT 5'-P03 

(SEQ ID NO:415) (SEQ 

5'-P03-TGCGACTGT 5'-P03 

(SEQ ID NO:417) (SEQ 

5'-P03-TCTGGAGGT 5'-P03 

(SEQ ID NO:419) (SEQ 

5'-P03-AGCACTGGT 5'-P03 

(SEQ ID NO:421) (SEQ 

5'-P03-TCGCTTGGT 5'-P03 

(SEQ ID NO:423) (SEQ 

5'-P03-AGCACTCGT 5'-P03 
(SEQ ID NO:425) : (SEQ 

5 ' -P03 -GCG ATTGGT 5 ? -P03 

(SEQ ID NO:427) (SEQ 

5'-P03-CCATCGCGT 5'-P03 

(SEQ ID NO:429) (SEQ 

5'-P03-TCGCTTCGT 5'-P03 

(SEQ ID NO:431) (SEQ 



-ATGCAGCCT 

ID NO:410) 
-ACCACTCCT 

ID NO: 4 12) 
-GGTAGTCCT 

ID NO:414) 
-ATCACCGCT 

ID NO: 4 16) 
-AGTCGCACT 

ID NO:418) 
-CTCCAGACT 

ID NO:420) 
-CAGTGCTCT 

ID NO:422) 
-CAAGCGACT 

ID NO:424) 
-GAGTGCTCT 

ID NO:426) 
■CAATCGCCT 

ID NO:428) 
•GCGATGGCT 

ID NO:430) 
GAAGCGACT 
ID NO: 432) 



5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03- 
(SEQ 
5'-P03- 
(SEQ 
5'-P03- 
(SEQ 



-AGTGCCTGT 

ID NO:433) 
-GGCATAGGT 

ID NO:435) 
-GCGATTCGT 

ID NO:437) 
-TGCGACGGT 

ID NO:439) 
-GAGTGGCGT 

ID NO: 441) 
-CGGTGAGGT 

ID NO: 443) 
-GCTGCAAGT 

ID NO: 44 5) 
-TTCCGCTGT 

ID NO:447) 
-GAGTGGAGT 

ID NO:449) 
ACAGAGCGT 

ID NO:451) 
TGCGACCGT 
ID NO:453) 



5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03- 
(SEQ 
5'-P03- 
(SEQ 
5'-P03- 
(SEQ 
5'-P03- 
(SEQ 



-AGGCACTCT 

ID NO:434) 
-CTATGCCCT 

ID NO:436) 
-GAATCGCCT 

ID NO:438) 
-CGTCGCACT 

ID NO: 44 0) 
-GCCACTCCT 

ID NO: 442) 
-CTCACCGCT 

ID NO: 444) 
-TTGCAGCCT 

ID NO:446) 
AGCGGAACT 

ID NO:448) 
•TCCACTCCT 

ID NO:450) 
GCTCTGTCT 

ID NO: 452) 
GGTCGCACT 
ID NO: 454) 
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5 ' -P03 -CCTGT AGGT 


5 ' -P03 -CT AC AGGCT 


2.72 


(SEQ ID NO:455) 


(SEQ ID NO:456) 




5 ' -P03 -T AGCCGTGT 


5 ' -P03 - ACGGCT ACT 


2.73 


(SEQ ID NO: 4 57) 


(SEQ ID NO:458) 




5 ' -P03 -TGCGACAGT 


5 ' -P03 -TGTCGC ACT 


2.74 


(SEQ ID NO: 4 59) 


(SEQ ID NO:460) 




5 ' -P03 -GGTCTGTGT 


5 ' -P03 - AC AG ACCCT 


2.75 


(SEQ ID NO: 4 61) 


(SEQ ID NO:462) 




5 ' -P03 -CGGTGAAGT 


5 ' -P03 -TTC ACCGCT 


2.76 


(SEQ ID NO:463) 


(SEQ ID NO: 4 64) 




5 ' -P03 -C AACG AGGT 


5 ' -P03 -CTCGTTGCT 


2.77 


(SEQ ID NO:465) 


(SEQ ID NO:466) 




5 ' -P03 -GCAGCATGT 


5 ' -P03 - ATGCTGCCT 


2.78 


(SEQ ID NO: 467) 


(SEQ ID NO: 4 68) 




5 '-P03 -TCGTCAGGT 


5 ' -P03 -CTG ACG ACT 


2.79 


(SEQ ID NO: 469) 


(SEQ ID NO: 4 70) 




5 ' -P03 -AGTGCCAGT 


5 ' -P03 -TGGC ACTCT 


2.80 


(SEQ ID NO: 4 71) 


(SEQ ID NO: 4 72) 




5 ' -P03 -T AG AGGCGT 


5 ' -P03 -GCCTCT ACT 


2.81 


(SEQ ID NO: 473) 


(SEQ ID NO:474) 




5 ' -P03 -GTC AGCGGT 


5'-P03-CGCTGACCT 


2.82 


(SEQ ID NO:475) 


(SEQ ID NO:476) 




5 ' -P03 -TC AGG AGGT 


5 ' -P03 -CTCCTG ACT 


2.83 


(SEQ ID NO:477) 


(SEQ ID NO: 478) 




5 ' -P03 -AGCAGGTGT 


5'-P03-ACCTGCTCT 


2.84 


(SEQ ID NO: 479 


(SEQ ID NO:480) 




5 ' -P03 -TTCCGC AGT 


5 ' -P03 -TGCGGAACT 


2.85 


(SEQ ID NO: 481) 


(SEQ ID NO:482) 




5 ' -P03 -GTC AGCCGT 


5 ' -P03 -GGCTG ACCT 


2.86 


(SEQ ID NO: 4 83) 


(SEQ ID NO:484) 




5 '-P03 -GGTCTGCGT 


5 ' -P03 -GC AG ACCCT 


2.87 


(SEQ ID NO:485) 


(SEQ ID NO: 486) 




5 ' -P03 -TAGCCG AGT 


5 9 -P03 -TCGGCT ACT 


2.88 


(SEQ ID NO: 4 87) 


(SEQ ID NO: 488) 




5 ' -P03 -GTC AGC AGT 


5 '-P03-TGCTGACCT 


2.89 


(SEQ ID NO: 489) 


(SEQ ID NO: 4 90) 




5 ' -P03 -GGTCTG AGT 


5 ' -P03 -TC AG ACCCT 


2.90 


(SEQ ID NO: 4 91) 


(SEQ ID NO: 4 92) 




5 ' -P03 -CGG AC AGGT 


5 ' -P03 -CTGTCCGCT 


2.91 


(SEQ ID NO: 4 93) 


(SEQ ID NO: 4 94) 




5 ' -P03 -TT AGCCGGT5 ' - 


5 ' -P03 -CGGCT AACT5 ' -P03 - 




P03-3' 


3' 


2.92 


(SEQ ID NO: 4 95) 


(SEO ID NO: 4 96) 




5 '-P03-GAGACGAGT 


5 '-P03-TCGTCTCCT 


2.93 


(SEQ ID NO:497) 


(SEQ ID NO: 4 98) 




5 ' -P03 -CGT AACCGT 


5 ' -P03 -GGTT ACGCT 


2.94 


(SEQ ID NO:499) 


(SEQ ID NO: 500) 




5'-P03-TTGGCGTGT5'- 


5 ' -P03 -ACGCCAACT5 ' -P03 - 




P03-3' 


3' 


2.95 


(SEQ ID NO:501) 


(SEQ ID NO: 502) 




5'-P03-ATGGCAGGT 


5 ' -P03 -CTGCC ATCT 


2.96 


(SEQ ID NO: 503) 


(SEQ ID NO: 504) 
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Table 5. Oligonucleotide tags used in cycle 3 



Tag 
number 



Top strand sequence 



Bottom strand 
sequence 



5'-P03-CAG CTA CGA 

3.1 (SEQ ID NO:505) 
5'-P03-CTC CTG CGA 

3.2 (SEQ ID NO:507) 
5'-P03-GCT GCC TGA 

3.3 (SEQ ID NO:509) 
5'-P03-CAG GAA CGA 

3.4 (SEQ ID NO:511) 
5'-P03-CAC ACG CGA 

3.5 (SEQ ID NO: 513) 
5'-P03-GCA GCC TGA 

3.6 (SEQ ID NO: 515) 
5'-P03-CTG AAC GGA 

3.7 (SEQ ID NO: 517) 
5'-P03-CTG AAC CGA 

3.8 (SEQ ID NO: 519) 
5'-P03-TCT GGA CGA 

3.9 (SEQ ID NO: 521) 
5'-P03-TGC CTA CGA 

3.10 (SEQ ID NO:523) 
5'-P03-GGC ATA CGA 

3.11 (SEQ ID NO:525) 
5'-P03-CGG TGA CGA 

3.12 (SEQ ID NO: 527) 



5'-P03-GTA GCT GAC 
(SEQ ID NO: 506) 
5'-P03-GCA GGA GAC 
(SEQ ID NO: 508) 
5'-P03-AGG CAG CAC 
(SEQ ID NO: 510) 
5'-P03-GTTCCTGAC 
(SEQ ID NO: 512) 
5'-P03-GCG TGT GAC 
(SEQ ID NO: 514) 
5'-P03-AGG CTG CAC 
(SEQ ID NO:516) 
5'-P03-CGTTCAGAC 
(SEQ ID NO:518) 
5'-P03-GGT TCA GAC 
(SEQ ID NO:52 0) 
5'-P03-GTC CAG AAC 
(SEQ ID NO: 522) 
5'-P03-GTA GGC AAC 
(SEQ ID NO: 524) 
5'-P03-GTA TGC CAC 
(SEQ ID NO: 526) 
5'-P03-GTC ACC GAC 
(SEQ ID NO: 528) 



5'-P03-CAA CGA CGA 

3.13 (SEQ ID NO:529) 
5'-P03-CTC CTC TGA 

3.14 (SEQ ID NO:531) 
5'-P03-TCA GGA CGA 

3.15 (SEQ ID NO:533) 
5'-P03-AAA GGC GGA 

3.16 (SEQ ID NO: 535) 
5'-P03-CTCCTCGGA 

3.17 (SEQ ID NO: 537) 
5'-P03-CAG ATG CGA 

3.18 (SEQ ID NO: 539) 
5'-P03-GCA GCA AG A 

3.19 (SEQ ID NO: 541) 
5'-P03-GTG GAG TGA 

3.20 (SEQ ID NO: 54 3) 
5'-P03-CCA GTA GGA 

3.21 (SEQ ID NO: 545) 
5'-P03-ATG GCA CGA 

3.22 (SEQ ID NO: 54 7) 



5'-P03-GTC GTT GAC 
(SEQ ID NO:530) 
5'-P03-AGA GGA GAC 
(SEQ ID NO: 532) 
5'-P03-GTC CTG AAC 
(SEQ ID NO: 534) 
5'-P03-CGC CTTTAC 
(SEQ ID NO: 536) 
5'-P03-CGAGGAGAC 
(SEQ ID NO: 538) 
5'-P03-GCA TCTGAC 
(SEQ ID NO: 540) 
5'-P03-TTG CTG CAC 
(SEQ ID NO: 542) 
5'-P03-ACT CCA CAC 
(SEQ ID NO: 544) 
5'-P03-CTA CTG GAC 
(SEQ ID NO: 546) 
5'-P03-GTG CCA TAC 
(SEQ ID NO: 548) 
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5 ' -P03-GGA CTG TGA 5 ' -P03 

3.23 (SEQ ID NO: 549) (SEQ 
5'-P03-CCG AACTGA 5'-P03 

3.24 (SEQ ID NO: 551) (SEQ 



-ACA GTC CAC 
ID NO:550) 

-AGT TCG GAC 
ID NO: 552) 



5'-P03-CTC CTC AGA 5 '-P03 

3.25 (SEQ ID NO: 553) (SEQ 
5'-P03-CACTGCTGA 5'-P03 

3.26 (SEQ ID NO: 555) (SEQ 
5'-P03-AGC AGGCGA 5'-P03 

3.27 (SEQ ID NO: 557) (SEQ 
5'-P03-AGC AGG AGA 5 '-P03 

3.28 (SEQ ID NO: 559) (SEQ 
5 '-P03-AGA GCC AGA 5 '-P03 

3.29 (SEQ ID NO: 561) (SEQ 
5'-P03-GTC GTT GGA 5'-P03 

3.30 (SEQ ID NO: 563) (SEQ 
5 '-P03-CCG AAC GGA 5 '-P03 

3.31 (SEQ ID NO: 565) (SEQ 
5'-P03-CAC TGC GGA 5'-P03 

3.32 (SEQ ID NO: 567) (SEQ 
5'-P03-GTG GAG CGA 5'-P03 

3.33 (SEQ ID NO: 569) (SEQ 
5'-P03-GTG GAG AGA 5'-P03 

3.34 (SEQ ID NO:571) . (SEQ 
5'-P03-GGACTGCGA 5'-P03 

3.35 (SEQ ID NO: 573) (SEQ 
5'-P03-CCG AAC CGA 5'-P03 

3.36 (SEQ ID NO: 575) (SEQ 



-TGA GGA GAC 

ID NO: 554) 
-AGC AGT GAC 

ID NO:556) 
-GCC TGC TAC 

ID NO:558) 
-TCC TGC TAC 

ID NO:560) 
-TGG CTC TAC 

ID NO: 5 62) 
-CAA CGA CAC 

ID NO: 564) 
-CGT TCG GAC 

ID NO: 566) 
-CGC AGT GAC 

ID NO:568) 
-GCT CCA CAC 

ID NO:570) 
-TCT CCA CAC 

ID NO: 572) 
-GCA GTC CAC 

ID NO: 574) 
-GGT TCG GAC 

ID NO:576) 



5'-P03-CAC TGC CGA 5 '-P03 

3.37 (SEQ ID NO: 57 7) (SEQ 
5'-P03-CGA AAC GGA 5 '-P03 

3.38 (SEQ ID NO: 579) (SEQ 
5 ' -P03-GGA CTG AGA 5 ' -P03 

3.39 (SEQ ID NO: 581) (SEQ 
5 ' -P03-CCG AAC AGA 5 ' -P03 

3.40 (SEQ ID NO: 583) (SEQ 
5'-P03-CGA AAC CGA 5'-P03 

3.41 (SEQ ID NO: 585) (SEQ 
5'-P03-CTGGCTTGA 5'-P03 

3.42 (SEQ ID NO: 587) (SEQ 
5'-P03-CAC ACCTGA 5'-P03 

3.43 (SEQ ID NO: 589) (SEQ 
5'-P03-AAC GAC CGA 5'-P03 

3.44 (SEQ ID NO: 591) (SEQ 
5'-P03-ATCCAGCGA 5'-P03 

3.45 (SEQ ID NO: 593) (SEQ 
5'-P03-TGCGAAGGA 5'-P03 

3.46 (SEQ ID NO: 595) (SEQ 
5'-P03-TGCGAACGA 5'-P03 

3.47 (SEQ ID NO: 597) (SEQ 
5'-P03-CTGGCTGGA 5'-P03 

3.48 (SEQ ID NO: 599) (SEQ 



GGC AGT GAC 
ID NO: 578) 
CGT TTC GAC 
ID NO: 58 0) 
TCA GTC CAC 
ID NO: 582) 
TGT TCG GAC 
ID NO: 584) 
GGT TTC GAC 
ID NO:586) 
AAG CCA GAC 
ID NO:588) 
AGG TGT GAC 
ID NO:590) 
GGT CGT TAC 
ID NO: 592) 
GCT GGA TAC 
ID NO: 594) 
CTT CGC AAC 
ID NO: 5 96) 
GTT CGC AAC 
ID NO:598) 
GAG CCA GAC 
ID NO:600) 
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5'-P03-CACACCGGA 5'-P03- 

3.49 (SEQ ID NO: 601) (SEQ 
5'-P03-AGTGCAGGA 5'-P03- 

3.50 (SEQ ID NO: 603) (SEQ 
5'-P03-GACCGTTGA 5'-P03 

3.51 (SEQ ID NO: 605) (SEQ 
5 '-P03-GGT GAG TGA 5 '-P03 

3.52 (SEQ ID NO: 607) (SEQ 
5 '-P03-CCT TCC TGA 5 '-P03 

3.53 (SEQ ID NO: 609) (SEQ 
5 '-P03-CTG GCT AGA 5 '-P03- 

3.54 (SEQ ID NO: 611) (SEQ 
5'-P03-CAC ACC AGA 5'-P03- 

3.55 (SEQ ID NO: 613) (SEQ 
5 '-P03-AGC GGT AGA 5 '-P03 

3.56 (SEQ ID NO: 615) (SEQ 
5 '-P03-GTC AGA GGA 5 '-P03- 

3.57 (SEQ ID NO: 617) (SEQ 
5 '-P03-TTC CGA CGA 5 '-P03 

3.58 (SEQ ID NO: 619) (SEQ 
5 '-P03-AGG CGT AGA 5 '-P03 

3.59 (SEQ ID NO: 621) (SEQ 
5 '-P03-CTC GAC TGA 5 '-P03 

3.60 (SEQ ID NO: 623) (SEQ 



CGG TGT GAC 
ID NO: 6 02) 
CTG CAC TAC 
ID NO: 6 04) 
AAC GGT CAC 
ID NO:606) 
ACT CAC CAC 
ID NO: 608) 
AGG AAG GAC 
ID NO: 610) 
TAG CCA GAC 
ID NO:612) 
TGG TGT GAC 
ID NO:614) 
TAC CGC TAC 
ID NO:616) 
CTC TGA CAC 
ID NO:618) 
GTC GGA AAC 
ID NO: 62 0) 
TAC GCC TAC 
ID NO: 622) 
AGT CGA GAC 
ID- NO : 624) . 



5'-P03-TAC GCT GGA ' 5'-P03 

3.61 (SEQ ID NO: 62 5) (SEQ 
5 '-P03-GTT CGG TGA 5 '-P03 

3.62 (SEQ ID NO: 62 7) (SEQ 
5 '-P03-GCC AGC AGA 5 '-P03 

3.63 (SEQ ID NO: 62 9) (SEQ 
5 '-P03-GAC CGT AGA 5 '-P03 

3.64 (SEQ ID NO: 631) (SEQ 
5 '-P03-GTG CTC TGA 5 '-P03 

3.65 (SEQ ID NO: 633) (SEQ 
5 '-P03-GGT GAG CGA 5 '-P03 

3.66 (SEQ ID NO: 635) (SEQ 
5 '-P03-GGT GAG AGA 5 '-P03 

3.67 (SEQ ID NO: 637) (SEQ 
5 '-P03-CCT TCC AGA 5 '-P03 

3.68 (SEQ ID NO: 639) (SEQ 
5 '-P03-CTC CTA CGA 5 '-P03 

3.69 (SEQ ID NO: 641) (SEQ 
5 '-P03-CTC GAC GGA 5 '-P03 

3.70 (SEQ ID NO: 643) (SEQ 
5 '-P03-GCC GTT TGA 5 '-P03 

3.71 (SEQ ID NO: 645) (SEQ 
5 '-P03-GCG GAG TGA 5 '-P03 

3.72 (SEQ ID NO: 64 7) (SEQ 



GAG CGT AAC 
ID NO: 626) 
ACCGAACAC 
ID NO: 628) 
TGC TGG CAC 
ID NO: 63 0) 
TAC GGT CAC 
ID NO: 632) 
AGA GCA CAC 
ID NO: 634) 
GCT CAC CAC 
ID NO:636) 
TCT CAC CAC 
ID NO:638) 
TGG AAG GAC 
ID NO: 640) 
GTA GGA GAC 
ID NO: 642) 
CGT CGA GAC 
ID NO: 644) 
-AAA CGG CAC 
ID NO: 64 6) 
ACT CCG CAC 
ID NO: 648) 



5 '-P03-CGT GCT TGA 5 '-P03 

3.73 (SEQ ID NO: 64 9) (SEQ 
5 '-P03-CTC GAC CGA 5 '-P03 

3.74 (SEQ ID NO: 651) (SEQ 



-AAG CAC GAC 
ID NO:650) 
GGT CGA GAC 
ID NO: 652) 
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5'-P03-AGAGCAGGA 5'-P03 

3.75 (SEQ ID NO: 653) (SEQ 
5 '-P03-GTG CTC GGA 5 '-P03 

3.76 (SEQ ID NO: 655) (SEQ 
5'-P03-CTC GAC AG A 5'-P03 

3.77 (SEQ ID NO: 657) (SEQ 
5 '-P03-GGA GAG TGA 5 '-P03 

3.78 (SEQ ID NO: 659) (SEQ 
5'-P03-AGGCTGTGA 5'-P03 

3.79 (SEQ ID NO: 661) (SEQ 
5 '-P03-AGA GCA CGA 5 '-P03 

3.80 (SEQ ID NO: 663) (SEQ 
5 '-P03-CCA TCC TGA 5 '-P03 

3.81 (SEQ ID NO: 665) (SEQ 
5'-P03-GTT CGG AG A 5'-P03 

3.82 (SEQ ID NO: 667) (SEQ 
5 '-P03-TGG TAG CGA 5 '-P03 

3.83 (SEQ ID NO: 669) (SEQ 
5'-P03-GTG CTC CGA 5'-P03 

3.84 (SEQ ID NO: 671) (SEQ 



■CTG CTC TAC 

ID NO: 654) 
-CGA GCA CAC 

ID NO:656) 
-TGT CGA GAC 

ID NO:658) 
-ACT CTC CAC 

ID NO:660) 
-ACA GCC TAC 

ID NO: 662) 
-GTG CTC TAC 

ID NO: 6 64) 
-AGG ATG GAC 

ID NO:666) 
-TCC GAA CAC 

ID NO: 668) 
-GCT ACC AAC 

ID NO:670) 
-GGA GCA CAC 

ID NO: 672) 



5 '-P03-GTG CTC AGA 5 '-P03 

3.85 (SEQ ID NO: 673) (SEQ 
5 '-P03-GCC GTT GGA 5 '-P03 

3.86 - (SEQ ID NO:675) (SEQ 

• 5'-P03-GAG TGCTGA 5'-P03 

3.87 :■ (SEQ . ID NO: 677) (SEQ 

5 '-P03-GCT CCT TGA 5 '-P03 

3.88 (SEQ ID NO: 679) (SEQ 
5 '-P03-CCG AAA GGA 5 '-P03 

3.89 (SEQ ID NO: 681) (SEQ 
5 '-P03-CAC TGA GGA 5 '-P03 

3.90 (SEQ ID NO: 683) (SEQ 
5'-P03-CGTGCTGGA 5'-P03 

3.91 (SEQ ID NO: 685) (SEQ 
5'-P03-CCG AAA CGA 5'-P03 

3.92 (SEQ ID NO: 687) (SEQ 
5'-P03-GCG GAG AGA 5'-P03 

3.93 (SEQ ID NO: 689) (SEQ 
5 '-P03-GCC GTT AGA 5 '-P03 

3.94 (SEQ ID NO: 691) (SEQ 
5'-P03-TCTCGTGGA 5'-P03 

3.95 (SEQ ID NO: 693) (SEQ 
5'-P03-CGT GCT AGA 5'-P03 

3.96 (SEQ ID NO: 695) (SEQ 



TGA GCA CAC 
ID NO: 674) 
CAA CGG CAC 
ID NO:676) 
AGC ACT CAC 
ID NO:678) 
AAG GAG CAC 
ID NO:680) 
CTT TCG GAC 
ID NO: 682) 
CTC AGT GAC 
ID NO: 684) 
CAG CAC GAC 
ID NO:686) 
■GTT TCG GAC 
ID NO: 688) 
TCT CCG CAC 
ID NO: 6 90) 
TAA CGG CAC 
ID NO: 692) 
•CAC GAG AAC 
ID NO:694) 
-TAG CAC GAC 
ID NO:696) 



Table 6. Oligonucleotide tags used in cycle 4 



Tag Bottom strand 

number Top strand sequence sequence 

5 ' -P03 -GCCTGTCTT 5'-P03-GAC AGG CTC 

4.1 (SEQ ID NO: 697) (SEQ ID NO: 698) 
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5'-P03-CTCCTGGTT 5'-P03-CCA GGA GTC 

4.2 (SEQ ID NO: 699) (SEQ ID NO: 700) 
5 ' -P03 - ACTCTGCTT 5 '-P03-GCA GAG TTC 

4.3 (SEQ ID NO:701) (SEQ ID NO:702) 
5 ' -P03 -C ATCGCCTT 5 ' -P03 -GGC GAT GTC 

4.4 (SEQ ID NO: 703) (SEQ ID NO: 704) 
5 ' -P03 -GCC ACTATT 5 ' -P03 -TAG TGG CTC 

4.5 (SEQ ID NO: 705) (SEQ ID NO: 706) 
5 ' -P03 -C AC ACGGTT 5 '-P03-CCG TGT GTC 

4.6 (SEQ ID NO:707) (SEQ ID NO:708) 
5'-P03-CAACGCCTT 5'-P03-GGC GTT GTC 

4.7 (SEQ ID NO:709) (SEQ ID NO:710) 
5 ' -P03 - ACTG AGGTT 5 '-P03-CCT CAG TTC 

4.8 (SEQ ID NO: 711) (SEQ ID NO: 712) 
5 '-P03-GTGCTGGTT 5 '-P03-CCA GCA CTC 

4.9 (SEQ ID NO: 713) (SEQ ID NO: 714) 
5 ' -P03 -C ATCG ACTT 5 '-P03-GTC GAT GTC 

4.10 (SEQ ID NO: 715) (SEQ ID NO: 716) 
5 '-POS-CCATCGGTT 5 '-P03-CCG ATG GTC 

4.11 (SEQ ID NO:717) (SEQ ID NO:718) 
5 '-P03-GCTGCACTT 5 '-P03-GTG CAG CTC 

4.12 (SEQ ID NO:719) (SEQ ID NO:720) 

5'-P03-ACAGAGGTT 5'-P03-CCT CTG TTC 

4.13 (SEQ ID NO: 721) (SEQ ID NO:722) 
5>-P03-AGTGCCGTT <) 5 '-P03-CGG CAG TTC 

4.14 (SEQ ID , NO: 723), : ' (SEQ ID NO: 724) 
5 '-P03-CGGACATTT 5 '-P03-ATG TCC GTC 

4.15 (SEQ ID NO:725) (SEQ ID NO:726) 
5 ' -P03 -GGTCTGGTT 5 '-P03-CCA GAC CTC 

4.16 (SEQ ID NO: 727) (SEQ ID NO: 728) 
5'-P03-GAGACGGTT 5'-P03-CCG TCT CTC 

4.17 (SEQ ID NO:729) (SEQ ID NO:730) 
5 ' -P03 -CTTTCCGTT 5 '-P03-CGG AAA GTC 

4.18 (SEQ ID NO:731) (SEQ ID NO:732) 
5 ' -P03 -C AG ATGGTT 5 '-P03-CCA TCT GTC 

4.19 (SEQ ID NO:733) (SEQ ID NO:734) 
5 ' -P03 -CGG AC ACTT 5 '-P03-GTG TCC GTC 

4.20 (SEQ ID NO:735) (SEQ ID NO:736) 
5 '-P03-ACTCTCGTT 5 '-P03-CGA GAG TTC 

4.21 (SEQ ID NO:737) (SEQ ID NO:738) 
5 '-P03-GCAGCACTT 5 '-P03-GTG CTG CTC 

4.22 (SEQ ID NO:739) (SEQ ID NO:740) 
5 ' -P03 - ACTCTCCTT 5 '-P03-GGA GAG TTC 

4.23 (SEQ ID NO: 741) (SEQ ID NO: 742) 
5 '-P03-ACCTTGGTT 5 '-P03-CCA AGG TTC 

4.24 (SEQ ID NO: 743) (SEQ ID NO: 744) 

5 ' -P03 - AG AGCCGTT 5 '-P03-CGG CTC TTC 

4.25 (SEQ ID NO: 745) (SEQ ID NO: 746) 
5 ' -P03 - ACCTTGCTT 5 '-P03-GCA AGG TTC 

4.26 (SEQ ID NO: 747) (SEQ ID NO: 748) 
5'-P03-AAGTCCGTT 5'-P03-CGG ACT TTC 

4.27 (SEQ ID NO:749) (SEQ ID NO:750) 



-66- 



5'-P03-GGACTGGTT 5'-P03 

4.28 (SEQ ID NO: 751) (SEQ 
5'-P03-GTCGTTCTT 5'-P03 

4.29 (SEQ ID NO: 753) (SEQ 
5'-P03-CAGCATCTT 5'-P03 

4.30 (SEQ ID NO: 755) (SEQ 
5'-P03-CTATCCGTT 5'-P03 

4.31 (SEQ ID NO: 757) (SEQ 
5'-P03-ACACTCGTT 5'-P03 

4.32 (SEQ ID NO: 759) (SEQ 
5'-P03-ATCCAGGTT 5'-P03 

4.33 (SEQ ID NO: 761) (SEQ 
5'-P03-GTTCCTGTT 5'-P03 

4.34 (SEQ ID NO: 763) (SEQ 
5'-P03-ACACTCCTT 5'-P03 

4.35 (SEQ ID NO: 765) (SEQ 
5'-P03-GTTCCTCTT 5'-P03 

4.36 (SEQ ID NO: 767) (SEQ 



CCA GTC CTC 
ID NO:752) 
GAA CGA CTC 
ID NO:754) 
GAT GCT GTC 
ID NO:756) 
CGG ATA GTC 
ID NO:758) 
CGA GTG TTC 
ID NO:760) 
CCT GGA TTC 
ID NO: 762) 
GAG GAA CTC 
ID NO: 764) 
GGA GTG TTC 
ID NO:766) 
GAG GAA CTC 
ID NO:768) 



5'-P03-CTGGCTCTT 5'-P03 

4.37 (SEQ ID NO: 769) (SEQ 
5'-P03-ACGGCATTT 5'-P03 

4.38 (SEQ ID NO: 771) (SEQ 
5 ' -P03 -GGTGAGGTT 5'-P03 

4.39 (SEQ ID NO:773) . (SEQ 

■ ; > 5'-P03-CCTTCCGTT 5'-P03 

4.40 (SEQ ID NO:775) (SEQ 
5'-P03-TACGCTCTT 5'-P03 

4.41 (SEQ . ID NO: 777) (SEQ 
5'-P03-ACGGCAGTT 5'-P03 

4.42 (SEQ ID NO: 77 9) (SEQ 
5'-P03-ACTGACGTT 5'-P03 

4.43 (SEQ ID NO: 781) (SEQ 
5'-P03-ACGGCACTT 5'-P03 

4.44 (SEQ ID NO: 783) (SEQ 
5'-P03-ACTGACCTT 5'-P03 

4.45 (SEQ ID NO: 785) (SEQ 
5'-P03-TTTGCGGTT 5'-P03 

4.46 (SEQ ID NO: 787) (SEQ 
5 ' -P03 -TGGTAGGTT 5'-P03 

4.47 (SEQ ID NO: 789) (SEQ 
5'-P03-GTTCGGCTT 5'-P03 

4.48 (SEQ ID NO: 791) (SEQ 



GAG CCA GTC 
ID NO:770) 
ATG CCG TTC 
ID NO: 772) 
CCT CAC CTC 
ID NO: 774) 
CGG AAG GTC 
ID NO:776) 
GAG CGT ATC 
ID NO:778) 

-CTG CCG TTC 
ID NO: 78 0 

-CGT CAG TTC 
ID NO: 782) 

-GTG CCG TTC 
ID NO: 784) 

-GGT CAG TTC 
ID NO:786) 
CCG CAA ATC 
ID NO:788) 
CCT ACC ATC 
ID NO: 790) 
GCC GAA CTC 
ID NO: 792) 



5'-P03-GCCGTTCTT 5'-P03 

4.49 (SEQ ID NO: 793) (SEQ 
5 ' -P03 -GGAGAGGTT 5'-P03 

4.50 (SEQ ID NO: 795) (SEQ 
5'-P03-CACTGACTT 5'-P03 

4.51 (SEQ ID NO: 797) (SEQ 
5'-P03-CGTGCTCTT 5'-P03 

4.52 (SEQ ID NO: 799) (SEQ 
5'-P03-AATCCGCTT 5'-P03 

4.53 (SEQ ID NO: 801) (SEQ 



-GAA CGG CTC 

ID NO: 794) 
-CCT CTC CTC 

ID NO:796) 
-GTC AGT GTC 

ID NO:798) 
-GAG CAC GTC 

ID NO:800) 
-GCGGATTTC 

ID NO: 802) 



-67- 



WO 2005/058479 



PCT/US2004/042964 



4.54 
4.55 
4.56 
4.57 
4.58 
4.59 
4.60 



5'-P03- 
(SEQ 
5'-P03- 
(SEQ 
5'-P03- 
(SEQ 
5'-P03- 
(SEQ 
5'-P03- 
(SEQ 
5'-P03- 
(SEQ 
5'-P03- 
(SEQ 



AGGCTGGTT 
ID NO:803) 
GCTAGTGTT 
ID NO:805) 
GGAGAGCTT 
ID N0:807) 
GGAGAGATT 
ID NO: 809) 
AGGCTGCTT 
ID NO:811) 
GAGTGCGTT 
ID NO: 813) 
CCATCCATT 
ID NO:815) 



5'-P03-CCA GCC TTC 
(SEQ ID NO:804) 
5'-P03-CAC TAG CTC 
(SEQ ID NO: 806) 
5'-P03-GCTCTCCTC 
(SEQ ID NO:808) 
5'-P03-TCTCTCCTC 
(SEQ ID NO:810) 
5'-P03-GCAGCCTTC 
(SEQ ID NO: 812) 
5'-P03-CGC ACT CTC 
(SEQ ID NO: 814) 
5'-P03-TGG ATG GTC 
(SEQ ID NO: 816) 



4.61 
4.62 
4.63 
4.64 
4.65 
4.66 
4.67 
4.68 
4.69 
4.70 
4.71 
4.72 



5'-P03- 
(SEQ 
5'-P03- 
(SEQ 
5'-P03- 
(SEQ 
5'-P03- 
(SEQ 
5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03- 
(SEQ 
5'-P03- 
(SEQ 
5'-P03 
(SEQ 
5'-P03 
(SEQ 
5'-P03 
(SEQ 



GCTAGTCTT 
ID NO: 817) 
AGGCTGATT 
ID NO:819) 
ACAGACGTT 
ID NO:821) 
GAGTGCCTT 
ID NO:823) 
ACAGACCTT 
ID NO -825). 
CGAGCTTTT. 
ID NO : 82.7) 
TTAGCGGTT 
ID NO:829) 
CCTCTTGTT 
ID NO: 831) 
GGTCTCTTT 
ID NO:833) 
GCCAGATTT 
ID NO: 835) 
GAGACCTTT 
ID NO:837) 
CACACAGTT 
ID NO:839) 



5'-P03-GAC TAG CTC 

(SEQ ID NO: 818) 
5'-P03-TCAGCCTTC 

(SEQ ID NO: 820) 
5'-P03-CGTCTGTTC 

(SEQ ID NO: 822) 
5'-P03-GGC ACT CTC 

(SEQ ID NO: 824) 
5'-P03-GGT CTG TTC 
. (SEQ ID..NO.:;82 6) 
! 5 -P03 - AAG CTG GTC 

(SEQ ID NO:828) : 
5'-P03-CCG CTA ATC 

(SEQ ID NO:830) 
5'-P03-CAA GAG GTC 

(SEQ ID NO: 832) 
5'-P03-AGAGACCTC 

(SEQ ID NO: 834) 
5'-P03-ATC TGG CTC 

(SEQ ID NO: 836) 
5'-P03-AGG TCT CTC 

(SEQ ID NO: 838) 
5'-P03-CTG TGT GTC 

(SEQ ID NO:840) 



5'-P03-CCTCTTCTT 5'-P03 

4.73 (SEQ ID NO: 841) (SEQ 
5'-P03-TAGAGCGTT 5'-P03 

4.74 (SEQ ID NO: 843) (SEQ 
5'-P03-GCACCTTTT 5'-P03 

4.75 (SEQ ID NO: 84 5) (SEQ 
5'-P03-GGCTTGTTT 5'-P03 

4.76 (SEQ ID NO: 84 7) (SEQ 
5'-P03-GACGCGATT 5'-P03 

4.77 (SEQ ID NO: 84 9) (SEQ 
5'-P03-CGAGCTGTT 5'-P03 

4.78 (SEQ ID NO: 851) (SEQ 
5'-P03-TAGAGCCTT 5'-P03 

4.79 (SEQ ID NO: 8 53) (SEQ 



GAA GAG GTC 
ID NO: 842) 
CGC TCT ATC 
ID NO: 844) 
AAG GTG CTC 
ID NO: 846) 
ACA AGC CTC 
ID NO: 84 8) 
TCG CGT CTC 
ID NO:850) 

-CAG CTC GTC 
ID NO: 852) 

-GGC TCT ATC 
ID NO: 854) 



-68- 



WO 2005/058479 



PCT/US2004/042964 



5 , -P03-CATCCGTTT 5'-P03-ACG GAT GTC 

4.80 (SEQ ID NO: 855) (SEQ ID NO: 856) 
5'-P03-GGTCTCGTT 5'-P03-CGA GAC CTC 

4.81 (SEQ ID NO:857) (SEQ ID NO:858) 
5'-P03-GCCAGAGTT 5'-P03-CTC TGG CTC 

4.82 (SEQ ID NO: 859) (SEQ ID NO: 860) 
5 ' -P03 -GAG ACCGTT 5 ' -P03-CGG TCT CTC 

4.83 (SEQ ID NO:861) (SEQ ID NO:862) 
5 ' -P03 -CGAGCTATT 5 ' -P03 -TAG CTC GTC 

4.84 (SEQ ID NO:863) (SEQ ID NO:864) 

5'-P03-GCAAGTGTT 5'-P03-CAC TTG CTC 

4.85 (SEQ ID NO: 865) (SEQ ID NO:866) 
5'-P03-GGTCTCCTT 5'-P03-GGA GAC CTC 

4.86 (SEQ ID NO:867) (SEQ ID NO:868) 
5'-P03-GCCAGACTT 5'-P03-GTC TGG CTC 

4.87 (SEQ ID NO: 869) (SEQ ID NO: 870) 
5'-P03-GGTCTCATT 5'-P03-TGA GAC CTC 

4.88 (SEQ ID NO:871) (SEQ ID NO:872) 
5'-P03-GAGACCATT 5'-P03-TGG TCT CTC 

4.89 (SEQ ID NO: 873) (SEQ ID NO:874) 
5'-P03-CCTTCAGTT 5'-P03-CTG AAG GTC 

4.90 (SEQ ID NO:875) (SEQ ID NO:876) 
5'-P03-GCACCTGTT 5'-P03-CAG GTG CTC 

4.91 (SEQ ID NO:;877) . ..(SEQ ID NO: 878) . 
5'-P03-AAAGGCGTT • 5 "-P03-CGC CTT TTC 

4.92 (SEQ ID NO: 879) (SEQ ID NO: 880) 
5'-P03-CAGATCGTT 5'-P03-CGA TCT GTC 

4.93 (SEQ ID NO:881) . (SEQ ID NO:882) 
5'-P03-CATAGGCTT 5'-P03-GCC TAT GTC 

4.94 (SEQ ID NO: 883) (SEQ ID NO: 884) 
5'-P03-CCTTCACTT 5'-P03-GTG AAG GTC 

4.95 (SEQ ID NO: 885) (SEQ ID NO: 886) 
5'-P03-GCACCTCTT 5'-P03-GAG GTG CTC 

4.96 (SEQ ID NO: 887) (SEQ ID NO: 888) 



Table 7: Correspondence between building blocks and oligonucleotide tags for Cycles 
1-4. 



Building 
block 


Cycle 1 


Cycle 2 


Cycle 3 


Cycle 4 


BB1 


1.1 


2.1 


3.1 


4.1 


BB2 


1.2 


2.2 


3.2 


4.2 


BB3 


1.3 


2.3 


3.3 


4.3 


BB4 


1.4 


2.4 


3.4 


4.4 


BB5 


1.5 


2.5 


3.5 


4.5 


BB6 


1.6 


2.6 


3.6 


4.6 


BB7 


1.7 


2.7 


3.7 


4.7 
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RD2 


1 8 


9 8 
Z.o 


D.o 


zl 8 


DRQ 


1 Q 

i .y 


9 0 
z.y 


1 Q 

D.y 


zl 0 


DDI A 
DD 1 W 


1 in 

1 . 1U 


9 1 n 

Z. 1 u 


i i n 
D. 1U 


zl 1 0 


DD 1 1 

-Djl> 1 1 


111 


9 11 
Z. 1 1 


111 


A 1 1 
4.11 


DDI/ 


1 ii 

1 . 1Z 


9 19 
Z. 1Z 


119 
J. 1Z 


A 1 9 
4. 1Z 


DD 1 1 

dd ID 


111 


9 11 
Z. 1 J 


111 
D. ID 


A 1 1 
4. ID 


DDI /I 
DD 1 4 


1 1 A 
1 . 14 


9 1/1 
Z. 14 


11/1 
J. 14 


A 1 A 
4.14 


DD 1 C 
DD 1 D 


1 1 < 


9 1^ 
Z. 1 J 


11^ 
D. 1 D 


A 1 S 
4. 1 D 


DD 1 
DD 1 O 


1 1 A 
1 . 1 0 


9 1 A 
Z. ID 


1 1 A 
J. ID 


A 1 A 
4. ID 


dd i n 

DD 1 / 


1 17 
1.1/ 


9 1 7 
Z. 1 / 


119 
D.l 1 


A 1 7 
4. 1 / 


DD 1 8 


1 1Q 


9 1 8 
Z. lo 


118 
J.15 


A 1 8 
4. lo 


DD 1 Q 

dd i y 


1 1 Q 

i . iy 


9 1 O 

z. iy 


1 1 Q 


A 1 0 

4. iy 


ddZU 


1 on 
1 .zu 


9 9H 

z.zu 


1 0(\ 
D.ZU 


A 9H 
4.ZU 


DDZ 1 


1 91 


9 91 
Z.Z 1 


1 91 
J.Zl 


A 91 
4.Z1 


RR99 
DD/Z 


1 

1 .zz 


9 99 
Z.ZZ 


1 99 
j.ZZ 


A 99 * . r 
4.ZZ 


DDO-1 

ddZj 


1 01 
1 .ZD 


9 91 
Z.Zj 


1 91 
j. ZD 


A 91 
4. ZD 


Djt>Z4 


1 9A 
1 ,Z4 


9 9/1 
Z.Z4 


1 9A 
D.Z4 


A 9A 
4.Z4 


PPO c 
ddZj 


1 9^ 
1 .ZD 


9 9S 
Z.ZJ 


1 9S 
D.ZD 


A 9S 
4. ZD 


ddZO 


1 

1 ,ZO 


9 9^ 
Z.ZO 


1 Ofx 

J.ZO 


A 9/^ 
4.ZO 


DDZ / 


1 97 
1 .Z / 


9 99 
Z.Z / 


1 97 
D.Z / 


A 97 * 
4.Z / 


DD O Q 

ddZo 


1 98 
1 .Zo 


9 98 
Z.Zo 


1 98 
j.Zo 


A 98 1 
4.ZO 


DD9Q 


1 90 

1 .Z7 


9 9Q 

Z.Z7 


1 90 


A 90 
4.Z7 


ddo n 


1 in 

1 JU 


9 in 

Z. DU 


1 in 


a in 

4.DU 


DDO 1 
DDJ 1 


1 n 


0 11 
Z.j 1 


1 11 


A 11 
4.D 1 


DD'}') 


1 19 
1 JZ 


9 19 
Z.jZ 


1 19 


A 19 
4.DZ 


DDJJ 


1 11 

1 .DD 


9 11 
Z.DD 


1 11 

D.DD 


A 11 
4.DD 


RR1/1 


1 1A 
1 . D4 


0 1A 
Z.D4 


1 1A 
D.D4 


A 1A 
4.D4 




1 IS 

L .DD 




1 IS 

J . D D 


A IS 


BB36 


1.36 


2.36 


3.36 


4.36 


BB37 


1.37 


2.37 


3.37 


4.37 


BB38 


1.38 


2.38 


3.38 


4.38 
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BB39 


1.39 


2.39 


3.39 


4.39 


BB40 


1.44 


2.44 


3.44 


4.44 


BB41 


1.41 


2.41 


3.41 


4.41 


BB42 


1.42 


2.42 


3.42 


4.42 


BB43 


1.43 


2.43 


3.43 


4.43 


BB44 


1.40 


2 40 


3.40 


4.40 


BB45 


1.45 


2.45 


3.45 


4.45 


BB46 


1.46 


2.46 


3.46 


4.46 


BB47 


1.47 


2.47 


3.47 


4.47 


BB48 


1.48 


2.48 


3.48 


4.48 


BB49 


1.49 


2 49 


3.49 


4.49 


BB50 


1.50 


2.50 


3.50 


4.50 


BB51 


1.51 


2.51 


3.51 


4.51 


BB52 


.1.52. 


2.52 


3.52 


4.52 


BB53 


1.53 ( 


2.53 


3.53 


4.53 


BB54 


1.54 


2.54 


3.54 


4.54 


BB55 


1.55 


2.55 


3.55 


4.55 


BB56 


1.56 


2.56 


3.56 


4.56 


BB57 


L57 


2.57 


3.57 


4.57 


BB58 


1.58 


2.58 


3.58 


4.58 


BB59 


1.59 


2.59 


3.59 


4.59 


BB60 


1.60 


2.60 


3.60 


4.60 


BB61 


1.61 


2.61 


3.61 


4.61 


BB62 


1.62 


2.62 


3.62 


4.62 


BB63 


1.63 


2.63 


3.63 


4.63 


BB64 


1.64 


2.64 


3.64 


4 64 


BB65 


1.65 


2.65 


3.65 


4.65 


BB66 


1.66 


2.66 


3.66 


4.66 


BB67 


1.67 


2.67 


3.67 


4.67 


BB68 


1.68 


2.68 


3.68 


4.68 


BB69 


1.69 


2.69 


3.69 


4.69 
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BB70 


1.70 


2.70 


3.70 


4.70 


BB71 


1.71 


2.71 


3.71 


4.71 


BB72 


1.72 


2.72 


3.72 


4.72 


BB73 


1.73 


2.73 


3.73 


4.73 


BB74 


1.74 


2.74 


3.74 


4.74 


BB75 


1.75 


2 75 


3.75 


4.75 


BB76 


1.76 


2.76 


3.76 


4.76 


BB77 


1.77 


2.77 


3.77 


4.77 


BB78 


1.78 


2.78 


3.78 


4.78 


BB79 


1.79 


2.79 


3.79 


4.79 


BB80 


1.80 


2 80 


3.80 


4.80 


BB81 


1.81 


2.81 


3.81 


4.81 


BB82 


L82 


2.82 


3.82 


4.82 


BB83 


1.96 


2.96 


3.96 


4.96 


BB84 


1.83 


2.83 


3.83 


4.83 


BB85 


1.84 


2.84 


3.84 


4.84 


BB86 


1.85 


2.85 


3.85 


4.85 


BB87 


1.86 


2.86 


3.86 


4.86 


BB88 


1.87 


2.87 


3.87 


4.87 


BB89 


1.88 


2.88 


3.88 


4.88 


BB90 


1.89 


2.89 


3.89 


4.89 


BB91 


1.90 


2.90 


3.90 


4.90 


BB92 


1.91 


2.91 


3.91 


4.91 


BB93 


1.92 


2.92 


3.92 


4.92 


BB94 


1.93 


2.93 


3.93 


4.93 


BB95 


1.94 


2.94 


3.94 


4.94 


BB96 


1.95 


2.95 


3.95 


4.95 



IX ligase buffer: 50 mM Tris, pH 7.5; 10 mM dithiothreitol; 10 mM MgCl 2 ; 2mM ATP; 
50 mM NaCl. 
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10X ligase buffer: 500 mM Tris, pH 7.5; 100 mM dithiothreitol; 100 mM MgCl 2 ; 20 
mM ATP; 500 mM NaCl 

Attachment of Water Soluble Spacer to Compound 2 

To a solution of Compound 2 (60 mL, 1 mM) in sodium borate buffer (150 
mM, pH 9.4) that was chilled to 4 °C was added 40 equivalents of N-Fmoc-15-amino- 
4,7,10,13-tetraoxaoctadecanoic acid (S-Ado) in N,N-dimethylformamide (DMF) (16 
mL, 0.15 M) followed by 40 equivalents of 4-(4,6-dimethoxy[1.3.5]triazin-2-yl)-4- 
methylmorpholinium chloride hydrate (DMTMM) in water (9.6 mL, 0.25 M). The 
mixture was gently shaken for 2 hours at 4 °C before an additional 40 equivalents of S- 
Ado and DMTMM were added and shaken for a further 16 hours at 4 °C. 

Following acylation, a 0.1X volume of 5 M aqueous NaCl and a 2.5X volume of 
cold (-20 °C) ethanol was added and the mixture was allowed to stand at -20 °C for at 
least one hour. The mixture was then centrifuged for 15 minutes at 14,000 rpm in a 4 °C 
centrifuge to give a white pellet which was washed with cold EtOH and then dried in a 
lyophilizer at room temperature for '30 minutes. The solid was dissolved in 40 mL of 
water and purified by Reverse Phase HPLC with a Waters Xterra RP i8 column. A 
binary mobile phase gradient profile was used to elute the product using a 50 mM 
aqueous triethylammonium acetate buffer at pH 7.5 and 99% acetontrile/1% water 
solution. The purified material was concentrated by lyophilization and the resulting 
residue was dissolved in 5 mL of water. A 0.1X volume of piperidine was added to the 
solution and the mixture was gently shaken for 45 minutes at room temperature. The 
product was then purified by ethanol precipitation as described above and isolated by 
centrifugation. The resulting pellet was washed twice with cold EtOH and dried by 
lyophilization to give purified Compound 3. 

Cycle 1 

To each well in a 96 well plate was added 12.5 jiL of a 4 mM solution of 
Compound 3 in water; 100 jaL of a 1 mM solution of one of oligonucleotide tags 1.1 to 
1.96, as shown in Table 3 (the molar ratio of Compound 3 to tags was 1 :2). The plates 
were heated to 95°C for 1 minute and then cooled to 16°C over 10 minutes. To each 
well was added 10 jaL of 10X ligase buffer, 30 units T4 DNA ligase (1 |xL of a 30 
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unit/|LiL solution (FermentasLife Science, Cat. No. EL0013)), 76.5 ^1 of water and the 
resulting solutions were incubated at 16 °C for 16 hours. 

After the ligation reaction, 20 |aL of 5 M aqueous NaCl was added directly to 
each well, followed by 500 )iL cold (-20 °C) ethanol, and held at -20 °C for 1 hour. The 
plates were centrifugated for 1 hour at 3200g in a Beckman Coulter Allegra 6R 
centrifuge using Beckman Microplus Carriers. The supernatant was carefully removed 
by inverting the plate and the pellet was washed with 70% aqueous cold ethanol at -20 
°C. Each of the pellets was then dissolved in sodium borate buffer (50 p,L, 150 mM, pH 
9.4) to a concentration of 1 mM and chilled to 4 °C. 

To each solution was added 40 equivalents of one of the 96 building block 
precursors in DMF (13 \iL, 0.15 M) followed by 40 equivalents of DMT-MM in water 
(8 juL, 0.25M), and the solutions were gently shaken at 4°C. After 2 hours, an additional 
40 equivalents of one of each building block precursor and DMTMM were added and 
the solutions were gently shaken for 16 hours at 4 °C. Following acylation, 10 
equivalents of acetic acid-N-hydrpxy-succinimide ester in DMF (2 jaL, 0.25M) was 
added to each solution and gently shaken for 10 minutes. 

Following acylation, the 96 reaction mixtures were pooled and 0.1 volume of 5M 
aqueous NaCl and 2.5 volumes of cold absolute ethanol were added and the solution was 
allowed to stand at -20 °C for at least one hour. The mixture was then centrifuged. 
Following centrifugation, as much supernatant as possible was removed with a 
micropipette, the pellet was washed with cold ethanol and centrifuged again. The 
supernatant was removed with a 200 fiL pipet. Cold 70% ethanol was added to the tube, 
and the resulting mixture was centrifuged for 5 min at 4°C. 

The supernatant was removed and the remaining ethanol was removed by 
lyophilization at room temperature for 10 minutes. The pellet was then dissolved in 2 
mL of water and purified by Reverse Phase HPLC with a Waters Xterra RPi 8 column. A 
binary mobile phase gradient profile was used to elute the library using a 50 mM 
aqueous triethylammonium acetate buffer at pH 7.5 and 99% acetontrile/1% water 
solution. The fractions containing the library were collected, pooled, and lyophilized. 
The resulting residue was dissolved in 2.5 mL of water and 250 jaL of piperidine was 
added. The solution was shaken gently for 45 minutes and then precipitated with 
ethanol as previously described. The resulting pellet was dried by lyophilization and 
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then dissolved in sodium borate buffer (4.8 mL, 150 mM, pH 9.4) to a concentration of 1 
mM. 

The solution was chilled to 4 °C and 40 equivalents each of N-Fmoc- 
propargylglycine in DMF (1 .2 mL, 0.15 M) and DMT-MM in water (7.7 mL, 0.25 M) 
were added. The mixture was gently shaken for 2 hours at 4 °C before an additional 40 
equivalents of N-Fmoc-propargylglycine and DMT-MM were added and the solution 
was shaken for a further 16 hours. The mixture was later purified by EtOH precipitation 
and Reverse Phase HPLC as described above and the N-Fmoc group was removed by 
treatment with piperidine as previously described. Upon final purification by EtOH 
precipitation, the resulting pellet was dried by lyophilization and carried into the next 
cycle of synthesis 

Cycles 2-4 

For each of these cycles, the dried pellet from the previous cycle was dissolved 
in water and the concentration of library was determined by spectrophotometry based on 
the extinction coefficient of the DNA component of the library, where the initial 
extinction coefficient of Compound 2 is 131,500 L/(mole.cm). The concentration of the 
library was adjusted with water such that the final concentration in the subsequent 
ligation reactions was 0.25 mM. The library was then divided into 96 equal aliquots in a 
96 well plate. To each well was added a solution comprising a different tag (molar ratio 
of the library to tag was 1 :2), and ligations were performed as described for Cycle 1 . 
Oligonucleotide tags used in Cycles 2, 3 aand 4 are set forth in Tables 4, 5 and 6, 
respectively. Correspondense between the tags and the building block precursors for 
each of Cycles 1 to 4 is provided in Table 7. The library was precipitated by the 
addition of ethanol as described above for Cycle 1, and dissolved in sodium borate 
buffer (150 mM, pH 9.4) to a concentration of 1 mM. Subsequent acylations and 
purifications were performed as described for Cycle 1, except HPLC purification was 
omitted during Cycle 3. 

The products of Cycle 4 were ligated with the closing primer shown below, using 
the method described above for ligation of tags. 

5'-P0 3 -CAG AAGACAGAC AAG CTT CAC CTG C (SEQ ID NO: 889) 
5'-P03-GCA GGT GAA GCT TGT CTG TCT TCT GAA (SEQ ID NO: 890) 
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Results: 



The synthetic procedure described above has the capability of producing a library 
comprising 96 4 (about 10 8 ) different structures. The synthesis of the library was 
monitored via gel electrophoresis and LC/MS of the product of each cycle. Upon 
completion, the library was analyzed using several techniques. Figure 13a is a 
chromatogram of the library following Cycle 4, but before ligation of the closing primer; 
Figure 13b is a mass spectrum of the library at the same synthetic stage. The average 
molecular weight was determined by negative ion LC/MS analysis. The ion signal was 
deconvoluted using ProMass software. This result is consistent with the predicted 
average mass of the library. 

The DNA component of the library was analyzed by agarose gel electrophoresis, 
which showed that the majority of library material corresponds to ligated product of the 
correct size. DNA sequence analysis of molecular clones of PCR product derived from - 
a sampling of the library shows that DNA ligation occurred with high fidelity and to 
near completion. 

Library cyclization 

At the completion of Cycle 4, a portion of the library was capped at the N- 
terminus using azidoacetic acid under the usual acylation conditions. The product, after 
purification by EtOH precipitation, was dissolved in sodium phosphate buffer (150 mM, 
pH 8) to a concentration of 1 mM and 4 equivalents each of CUSO4 in water (200 mM), 
ascorbic acid in water (200 mM), and a solution of the compound shown below in DMF 
(200 mM) were added. The reaction mixture was then gently shaken for 2 hours at room 
temperature. 




Ph 
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To assay the extent of cyclization, 5 \\L aliquots from the library cyclization 
reaction were removed and treated with a fluorescently-labeled azide or alkyne (1|J,L of 
100 mM DMF stocks) prepared as described in Example 4. .After 16 hours, neither the 
alkyne or azide labels had been incorporated into the library by HPLC analysis at 500 
nm. This result indicated that the library no longer contained azide or alkyne groups 
capable of cycloaddition and that the library must therefore have reacted with itself, 
either through cyclization or intermolecular reactions. The cyclized library was purified 
by Reverse Phase HPLC as previously described. Control experiments using uncyclized 
library showed complete incorporation of the fluorescent tags mentioned above. 

Example 4: Preparation of Fluorescent Tags for Cyclization Assay: 

In separate tubes, propargyl glycine or 2-amino-3-phenylpropylazide (8 \imo\ 
each) was combined with FAM-OSu (Molecular Probes Inc.) (1.2 equiv.) in pH 9.4 
borate buffer (250 jjL). The reactions were allowed to proceed for 3 h at room 
temperature, and were then lyophilized overnight. Purification by HPLC afforded the 
desired fluorescent alkyne and azide in quantitative yield. 




FAM-OSu 





Fluorescent azide 
labeling agent 



Fluorescent alkyne 
labeling agent 
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Example 5: Cyclization of individual compounds using the azide/alkyne cycloaddition 
reaction 

Preparation of Azidoacetyl-Gly-Pro-Phe-Pra-NH 2 : 

Using 0.3 mmol of Rink-amide resin, the indicated sequence was synthesized 
using standard solid phase synthesis techniques with Fmoc-protected amino acids and 
HATU as activating agent (Pra = C-propargylglycine). Azidoacetic acid was used to cap 
the tetrapeptide. The peptide was cleaved from the resin with 20% TFA/DCM for 4 h. 
Purification by RP HPLC afforded product as a white solid (75 mg, 51%). ] H NMR 
(DMSO-d 6 , 400 MHz): 8.4 - 7.8 (m, 3H), 7.4 - 7. 1 (m, 7 H), 4.6 - 4.4 (m, 1H), 4.4 - 4.2 
(m, 2H), 4.0 - 3.9 (m, 2H), 3.74 (dd, 1H, J = 6 Hz, 17 Hz), 3.5 - 3.3 (m, 2H), 3.07 (dt, 
1H, J = 5 Hz, 14 Hz), 2.92 (dd, 1H, J = 5 Hz, 16 Hz), 2.86 (t, 1H, J = 2 Hz), 2.85 - 2.75 
(m, 1H), 2.6 - 2.4 (m, 2H), 2.2 - 1.6 (m, 4H). IR (mull) 2900, 2100, 1450, 1300 cm" 1 . 
ESIMS 497.4 ([M+H], 100%), 993.4 ([2M+H], 50%). ESIMS with ion-source 
fragmentation: 519.3 ([M+Na], 100%), 491.3 (100%), 480.1 ([M-NH 2 ], 90%), 452.2 
([M-NH 2 -CO], 20%), 424.2 (20%), 385.1 ([M-Pra], 50%), 357.1 ([M-Pra-CO], 40%), . 
238.0 ([M-Pra-Phe], 100%). 

Cyclization of Azidoacetyl-Gly-Pro-Phe-Pra-NH2: 



The azidoacetyl peptide (31 mg, 0.62 mmol) was dissolved in MeCN (30 mL). 
Diisopropylethylamine (DIE A, 1 mL) and Cu(MeCN)4PF 6 (1 mg) were added. After 
stirring for 1.5 h, the solution was evaporated and the resulting residue was taken up in 
20% MeCN/H 2 0. After centrifugation to remove insoluble salts, the solution was 
subjected to preparative reverse phase HPLC. The desired cyclic peptide was isolated as 
a white solid (10 mg, 32%). ] H NMR (DMSO-d 6 , 400 MHz): 8.28 (t, 1H, J = 5 Hz), 7.77 
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(s, 1H), 7.2 - 6.9 (m, 9H), 4.98 (m, 2H), 4.48 (m, 1H), 4.28 (m, 1H), 4.1 - 3.9 (m, 2H), 
3.63 (dd, 1H, J = 5 Hz, 16 Hz), 3.33 (m, 2H), 3.0 (m, 3H), 2.48 (dd, 1H, J = 1 1 Hz, 14 
Hz), 1.75 (m, 1H0, 1.55 (m, 1H), 1.32 (m, 1H), 1.05 (m, 1H). IR (mull) 2900, 1475, 
1400 cm" 1 . ESIMS 497.2 ([M+H], 100%), 993.2 ([2M+H], 30%), 1015.2 ([2M+Na], 
15%). ESIMS with ion-source fragmentation: 535.2 (70%), 519.3 ([M+Na], 100%), 
497.2 ([M+H], 80%), 480.1 ([M-NH 2 ], 30%), 452.2 ([M-NH 2 -CO], 40%), 208.1 (60%). 

Preparation of Azidoacetyl-Gly-Pro-Phe-Pra-Gly-OH: 

Using 0.3 ramol of Glycine- Wang resin, the indicated sequence was synthesized 
using Fmoc-protected amino acids and HATU as the activating agent. Azidoacetic acid 
was used in the last coupling step to cap the pentapeptide. Cleavage of the peptide was 
achieved using 50% TFA/DCM for 2 h. Purification by RP HPLC afforded the peptide 
as a white solid (83 mg; 50%). 'H NMR (DMSO-d 6 , 400 MHz): 8.4 - 7.9 (m, 4H), 7.2 
(m, 5H), 4.7 - 4.2 (m, 3H), 4.0 - 3.7 (m, 4H), 3.5 - 3.3 (m, 2H), 3.1 (m, 1H), 2.91 (dd, 
1H, J = 4 Hz, 16 Hz), 2.84 (t, 1H, J = 2.5 Hz), 2.78 (m, 1H), 2.6 - 2.4 (m, 2H), 2.2 - 1.6 
:- (m, 4H), IR.(mull) 2900, 2100, 1450, 1350 cm". 1 . ESIMS 555.3 ([M+H], 100%). ESIMS 
with ion-source fragmentation: 577.1 ([M+Na], 90%), 555.3 ([M+H], 80%), 480.1 ([M- 
Gly], 100%), 385.1 ([M-Gly-Pra], 70%), 357.1 ([M-Gly-Pra-CO], 40%), 238.0 ([M-Gly- 
Pra-Phe], 80%). 

Cyclization of Azidoacetyl-Gly-Pro-Phe-Pra-Gly-OH: 

The peptide (32 mg, 0.058 mmol) was dissolved in MeCN (60 mL). 
Diisopropylethylamine (1 mL) and Cu(MeCN) 4 PF 6 (1 mg) were added and the solution 
was stirred for 2 h. The solvent was evaporated and the crude product was subjected to 
RP HPLC to remove dimers and trimers. The cyclic monomer was isolated as a colorless 
glass (6 mg, 20%). ESIMS 555.6 ([M+H], 100%), 1109.3 ([2M+H], 20%), 1131.2 
([2M+Na], 15%). 

ESIMS with ion source fragmentation: 555.3 ([M+H], 100%), 480.4 ([M-Gly], 30%), 
452.2 ([M-Gly-CO], 25%), 424.5 ([M-Gly-2CO], 10%, only possible in a cyclic 
structure). 

Conjugation of Linear Peptide to DNA: 
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Compound 2 (45 nmol) was dissolved in 45 [iL sodium borate buffer (pH 9.4; 
150 mM). At 4° C, linear peptide (18 |nL of a 100 mM stock in DMF; 180 nmol; 40 
equiv.) was added, followed by DMT-MM (3.6 ^iL of a 500 mM stock in water; 180 
nmol; 40 equiv.). After agitating for 2 h, LCMS showed complete reaction, and product 
was isolated by ethanol precipitation. ESIMS 1823.0 ([M-3H]/3, 20%), 1367.2 ([M- 
4H]/4, 20%), 1093.7 ([M-5H]/5, 40%), 911.4 ([M-6HJ/6, 100%). 

Conjugation of Cyclic Peptide to DNA: 

Compound 2 (20 nmol) was dissolved in 20 jiL sodium borate buffer (pH 9.4, 
150 mM). At 4° C, linear peptide (8 jiL of a 100 mM stock in DMF; 80 nmol; 40 equiv.) 
was added, followed by DMT-MM (1.6 p.L of a 500 mM stock in water; 80 nmol; 40 
equiv.). After agitating for 2 h, LCMS showed complete reaction, and product was 
isolated by ethanol precipitation. ESIMS 1823.0 ([M-3H]/3, 20%), 1367.2 ([M-4H]/4, 
20%), 1093.7 ([M-5H]/5, 40%), 911.4 ([M-6H]/6, 100%). 

Cyclization of DNA-Linked Peptide: 

Linear peptide-DNA conjugate (10 nmol) was dissolved in pH 8 sodium 
phosphate buffer (10 |iL, 150mm). At room temperature, 4 equivalents each of Q1SO4, 
ascorbic acid, and the Sharpless ligand were all added (0.2 |iL of 200 mM stocks). The 
reaction was allowed to proceed overnight. RP HPLC showed that no linear peptide- 
DNA was present, and that the product co-eluted with authentic cyclic peptide-DNA. No 
traces of dimers or other oligomers were observed. 
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o r-i o J H o 

/ — NH O Ph 



4 equiv. each CuS0 4 , 
ascorbic acid, ligand 



pH 8 phosphate 
ligand = 



4 



v 



elutes (3) 4.48 min. 



elutes (S) 4.27 min. 



LC conditions: Targa C18, 2.1 x 40 mm, 10-40% 
MeCN in 40mM aq. TEAA over 8 min. 



Example 6: Application of Aromatic Nucleophilc Substitution Reactions to 
Functional Moiety Synthesis 

General Procedure for Arylation of Compound 3 with Cyanuric Chloride: 

Compound 2 is dissolved in pH 9.4 sodium borate buffer at a concentration of 1 
mM. The solution is cooled to 4° C and 20 equivalents of cyanuric chloride is then 
added as a 500 mM solution in MeCN. After 2h, complete reaction is confirmed by 
LCMS and the resulting dichlorotriazine-DNA conjugate is isolated by ethanol 
precipitation. 

Procedure for Amine Substitution of Dichlorotriazine-DNA: 

The dichlorotriazine-DNA conjugate is dissolved in pH 9.5 borate buffer at a 
concentration of 1 mM. At room temperature, 40 equivalents of an aliphatic amine is 
added as a DMF solution. The reaction is followed by LCMS and is usually complete 
after 2 h. The resulting alkylamino-monochlorotriazine-DNA conjugate is isolated by 
ethanol precipitation. 
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Procedure for Amine Substitution of Monochlorotriazine-DNA: 

The alkylamino-monochlorotriazine-DNA conjugate is dissolved in pH 9.5 
borate buffer at a concentration of 1 mM. At 42° C, 40 equivalents of a second aliphatic 
amine is added as a DMF solution. The reaction is followed by LCMS and is usually 
complete after 2 h. The resulting diaminotriazine-DNA conjugate is isolated by ethanol 
precipitation. 

Example 7: Application of Reductive Amination Reactions to Functional Moiety 
Synthesis 

General Procedure for Reductive Amination of DNA-Linker Containing a Secondary 
Amine with an Aldehyde Building Block: 

Compound 2 was coupled to an N-terminal proline residue. The resulting 
compound was dissolved in sodium phosphate buffer (50 jaL, 150 mM, pH 5.5) at a 
concentration of 1 mM. To this solution was added 40 equivalents each of an aldehyde 
building block in DMF (8 |aL, 0.25M) and sodium cyanoborohydride in DMF (8 |aL, 
0.25M) and the solution was heated at 80 °C for 2 hours. Following alkylation, the 
solution was purified by ethanol precipitation. 

General Procedure for Reductive Animations of DNA-Linker Containing an Aldehyde 
with Amine Building Blocks: 

Compound 2 coupled to a building block comprising an aldehyde group was 
dissolved in sodium phosphate buffer (50 jaL, 250 mM, pH 5.5) at a concentration of 1 
mM. To this solution was added 40 equivalents each of an amine building block in DMF 
(8 jiL, 0.25M) and sodium cyanoborohydride in DMF (8 jxL, 0.25M) and the solution 
was heated at 80 °C for 2 hours. Following alkylation, the solution was purified by 
ethanol precipitation. 
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Example 8: Application of Peptoid Building Reactions to Functional Moiety 
Synthesis 

General Procedure for Peptoid Synthesis on DNA-Linker: 




DNA-Linker 



40 eqivalents 40 eqivalents 



Compound 2 was dissolved in sodium borate buffer (50 |iL, 150 mM, pH 9.4) at 
a concentration of 1 mM and chilled to 4 °C. To this solution was added 40 equivalents 
of N-hydroxysuccinimidyl bromoacetate in DMF (13 jaL, 0.15 M) and the solution was 
gently shaken at 4 °C for 2 hours. Following acylation, the DNA-Linker was purified by 
ethanol precipitation and redissolved in sodium borate buffer (50 juL, 150 mM, pH 9.4) 
at a concentration of 1 mM and chilled to 4 °C. To this solution was added 40 eqivalents 
of an amine building block in DMF (13 |iL, 0.15 M) and the solution was gently shaken 
at 4 °C for 16 hours. Following alkylation, the DNA-linker was purified by ethanol 
precipitation and redissolved in sodium borate buffer (50 \xL, 150 mM, pH 9.4) at a 
concentration of 1 mM and chilled to 4 °C. Peptoid synthesis is continued by the 
stepwise addition of N-hydroxysuccinimidyl bromoacetate followed by the addition of 
an amine building block. 

Example 9: Application of the Azide-Alkyne Cycloaddition Reaction to Functional 
Moiety Synthesis 

General procedure 

An alkyne-containing DNA conjugate is dissolved in pH 8.0 phosphate buffer at 
a concentration of ca. ImM. To this mixture is added 10 equivalents of an organic azide 
and 5 equivalents each of copper (II) sulfate, ascorbic acid, and the ligand (tris-((l- 
benzyltriazol-4-yl)methyl)amine all at room temperature. The reaction is followed by 
LCMS, and is usually complete after 1 - 2 h. The resulting triazole-DNA conjugate can 
be isolated by ethanol precipitation. 
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Example 10 Identification of a ligand to Abl kinase from within an encoded library 

The ability to enrich molecules of interest in a DNA-encoded library above 
undesirable library members is paramount to identifying single compounds with defined 
properties against therapeutic targets of interest. To demonstrate this enrichment ability 
a known binding molecule (described by Shah et aL, Science 305, 399-401 (2004), 
incorporated herein by reference) to rhAbl kinase (GenBank U07563) was synthesized. 
This compound was attached to a double stranded DNA oligonucleotide via the linker 
described in the preceding examples using standard chemistry methods to produce a 
molecule similar (functional moiety linked to an oligonucleotide) to those produced via 
the methods described in Examples 1 and 2. A library generally produced as described 
in Example 2 and the DNA-linked Abl kinase binder were designed with unique DNA 
sequences that allowed qPCR analysis of both species. The DNA-linked Abl kinase 
binder was mixed with the library at a ratio of 1 : 1000. This mixture was equilibrated 
with to rhAble kinase, and the enzyme was captured on a solid phase, washed to remove 
non-binding library members and binding molecules were eluted. The ratio of library 
molecules to the DNA-linked Abl kinase inhibitor in the eluate was 1:1, indicating a 
greater than 500-fold enrichment of the DNA-linked Abl-kinase binder in a 1000-fold 
excess of library molecules. 

Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following 
claims. 
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Claims 

1 . A method of synthesizing a molecule comprising a functional moiety which is 
operatively linked to an encoding oligonucleotide, said method comprising the steps of: 

(a) providing an initiator compound consisting of an initial functional moiety 
comprising n building blocks, where n is an integer of 1 or greater, wherein the 
initial functional moiety comprises at least one reactive group, and is operatively 
linked to an initial oligonucleotide; 

(b) reacting the initiator compound with a building block comprising at least one 
complementary reactive group, wherein the at least one complementary reactive 
group is complementary to the reactive group of step (a), under conditions 
suitable for reaction of the complementary reactive group to form a covalent 
bond; 

. (c) reacting the initial oligonucleotide with an incoming oligonucleotide which 
identifies the building block of step (b) in the presence of an enzyme which 
catalyzes ligation of the initial oligonucleotide and the incoming oligonucleotide, 
under conditions suitable for ligation of the incoming oligonucleotide and the 
initial oligonucleotide to form an encoding oligonucleotide; 

thereby producing a molecule which comprises a functional moiety comprising 
n+1 building blocks which is operatively linked to an encoding oligonucleotide. 

2. The method of Claim 1 wherein the functional moiety of step (c) comprises a 
reactive group, and steps (a) to (c) are repeated one or more times, thereby forming 
cycles 1 to i, where i is an integer of 2 or greater, wherein the product of step (c) of a 
cycle s, where s is an integer of i-1 or less, is the initiator compound of cycle s + 1 . 

3. The method of Claim 1 wherein step (c) precedes step (b) or step (b) precedes 
step (c). 
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4. The method of any of Claim 1 wherein at least one of the building blocks is an 
amino acid or an activated amino acid. 

5. The method of Claim 1 wherein the reactive group and the complementary 
reactive group are selected from the group consisting of an amino group; a carboxyl 
group; a sulfonyl group; a phosphonyl group; an epoxide group; an aziridine group; and 
an isocyanate group. 

6. The method of Claim 1 wherein the reactive group and the complementary 
reactive group are selected from the group consisting of a hydroxyl group; a carboxyl 
group; a sulfonyl group; a phosphonyl group; an epoxide group; an aziridine group; and 
an isocyanate group. 

7. The method of Claim 1 wherein the reactive group and the complementary 
reactive group are selected from the group consisting of an amino group and an aldehyde 
or ketone group. 

8. The method of claim 7 wherein the reaction between the reactive group and the 
complementary reactive group is conducted under reducing conditions. 

9. The method of Claim 1 wherein the reactive group and the complementary 
reactive group are selected from the group consisting of a phosphorous ylide group and 
an aldehyde or ketone group. 

10. The method of Claim 1 wherein the reactive group and the complementary 
reactive group react via cycloaddition to form a cyclic structure. 

1 1 . The method of Claim 10 wherein the reactive group and the complementary 
reactive group are selected from the group consisting of an alkyne and an azide. 

12. The method of Claim 10 wherein the reactive group and the complementary 
reactive group are selected from the group consisting of a halogenated heteroaromatic 
group and a nucleophile. 
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13. The method of Claim 12 wherein the halogenated heteroaromatic group is 
selected from the group consisting of chlorinated pyrimidines, chlorinated triazines and 
chlorinated purines. 

14. The method of Claim 12 wherein the nucleophile is an amino group. 

15. The method of Claim 1, wherein the enzyme is selected from the group 
consisting of a DNA ligase, an RNA ligase, a DNA polymerase, an RNA polymerase 
and a topoisomerase. 

16. The method of Claim 1 wherein the initial oligonucleotide is double-stranded or 
single stranded. 

17. The method of Claim 16 wherein the initial oligonucleotide comprises a PCR 
primer sequence; . . 

18. The method of claim 16 wherein the initial oligonucleotide is single-stranded and 
the incoming oligonucleotide is single-stranded; or the initial oligonucleotide is double- 
stranded and the incoming oligonucleotide is double-stranded. 

19. The method of Claim 18 wherein the initial functional moiety and the initial 
oligonucleotide are linked by a linking moiety. 

20. The method of Claim 19 wherein the initial oligonucleotide is double-stranded 
and the linking moiety is covalently coupled to the initial functional moiety and to both 
strands of the initial oligonucleotide. 

21 . The method of Claim 1 wherein the incoming oligonucleotide is from 3 to 10 
nucleotides in length. 

22. The method of claim 2, wherein the incoming oligonucleotide of cycle i 
comprises a PCR closing primer. 
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23. The method of claim 2, further comprising in cycle i, the step of 

(d) ligating an oligonucleotide comprising a closing PCR primer sequence to the 
encoding oligonucleotide. 

24. The method of Claim 23 wherein the oligonucleotide comprising a closing PCR 
primer sequence is ligated to the encoding oligonucleotide in the presence of an enzyme 
which catalyzes said ligation. 

25. The method of Claim 2, further comprising after cycle i, the step of 

(e) cyclizing the functional moiety. 

26. The method of Claim 25 wherein the functional moiety comprises an alkynyl 
group and an azido group, and the compound is subjected to conditions suitable for 
cycloaddition of the alkynyl group and the azido group to form a triazole group, thereby 
cyclizing the functional moiety. 

27. A method of synthesizing a library of compounds, wherein the compounds 
comprise a functional moiety comprising two or more building blocks which is 
operatively linked to an initial oligonucleotide which identifies the structure of the 
functional moiety, said method comprising the steps of 

(a) providing a solution comprising m initiator compounds, wherein m is an 
integer of 1 or greater, where the initiator compounds consist of a functional moiety 
comprising n building blocks, where n is an integer of 1 or greater, which is operatively 
linked to an initial oligonucleotide which identifies the n building blocks; 

(b) dividing the solution of step (a) into r reaction vessels, wherein r is an integer 
of 2 or greater, thereby producing r aliquots of the solution; 

(c) reacting the initiator compounds in each reaction vessel with one of r building 
blocks, thereby producing r aliquots comprising compounds consisting of a functional 
moiety comprising n+1 building blocks operatively linked to the initial oligonucleotide; 
and 

(d) reacting the initial oligonucleotide in each aliquot with one of a set of r 
distinct incoming oligonucleotides in the presence of an enzyme which catalyzes the 
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ligation of the incoming oligonucleotide and the initial oligonucleotide, under conditions 
suitable for enzymatic ligation of the incoming oligonucleotide and the initial 
oligonucleotide; 

thereby producing r aliquots comprising molecules consisting of a functional 
moiety comprising n+1 building blocks operatively linked to an elongated 
oligonucleotide which encodes the n+1 building blocks. 

28. The method of Claim 27, further comprising the step of 

(e) combining two or more of the r aliquots, thereby producing a solution 
comprising molecules consisting of a functional moiety comprising n+1 building 
blocks, which is operatively linked to an elongated oligonucleotide which encodes the n 
+1 building blocks. 

29. The method of claim 28 wherein r aliquots are combined. 

30- The method of Claim 28 wherein the steps (a) to (e) are conducted one or more 
times to yield cycles 1 to i, where i is an integer of 2 or greater, wherein in cycle s+1, 
where s is an integer of i-1 or less, the solution comprising m initiator compounds of 
step (a) is the solution of step (e) of cycle s. 

3 1 . The method of either Claim 7 or Claim 8 wherein in at least one of cycles 1 to i 
step (d) precedes step (c). 

32.. The method of of Claim 28 wherein at least one of building blocks is an amino 
acid. 

33. The method of Claim 7 , wherein the enzyme is DNA ligase, RNA ligase, DNA 
polymerase, RNA polymerase or topoisomerase. 

34. The method of claim 28 wherein the initial oligonucleotide is a double- 
stranded oligonucleotide. 
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35. The method of Claim 34 wherein the incoming oligonucleotide is a double- 
stranded oligonucleotide. 



36 The method of Claim 28 wherein the initiator compounds comprise a linker 
moiety comprising a first functional group adapted to bond with a building block, a 
second functional group adapted to bond to the 5 'end of an oligonucleotide, and a third 
functional group adapted to bond to the 3 '-end of an oligonucleotide. 



37. 



The method of Claim 36 wherein the linker moiety is of the structure 
A 




E 

i 

B 

wherein 

A is a functional group adapted to bond to a building block; 

B is a functional group adapted to bond to the 5 '-end of an oligonucleotide; 

C is a functional group adapted to bond to the 3 '-end of an oligonucleotide; 

S is an atom or a scaffold; 

D is a chemical structure that connects A to S; 

E is a chemical structure that connects B to S; and 

F is a chemical structure that connects C to S. 



38. The method of Claim 37 wherein: 
A is an amino group; 
B is a phosphate group; and 
C is a phosphate group. 



39. The method of Claim 37 wherein D, E and F are each, independently, an 
alkylene group or an oligo(ethylene glycol) group. 
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40. The method of Claim 37 wherein S is a carbon atom, a nitrogen atom, a 
phosphorus atom, a boron atom, a phosphate group, a cyclic groupor a polycyclic group. 

41 . The method of claim 40 wherein the linker moiety is of the structure 



-OP(0) 2 0- (CH 2 CH 2 0) m — opo 3 - 

-N (CH 2 ) n - 

-OP(0) 2 Q- (CH 2 CH 2 0)p OPO3- 




wherein each of n, m and p is, independently, an integer from 1 to about 20. 

42. The method of Claim 41 wherein each of n, m and p is independently an integer 
from 2 to eight. 



43. The method of Claim 42 wherein each of n, m and p is independently an integer 
from 3 to 6. 

44. The method of Claim 41 wherein the linker moiety has the structure 




45. The method of claim 27, wherein each of said initiator compounds comprises a 
reactive group and wherein each of said r building blocks comprises a complementary 
reactive group which is complementary to said reactive group. 

46. The method of Claim 45 wherein the reactive group and the complementary 
reactive group are selected from the group consisting of an amino group ; a carboxyl 
group; a sulfonyl group; a phosphonyl group; an epoxide group; an aziridine group; and 
an isocyanate group. 
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47. The method of Claim 45 wherein reactive group and the the complementary 
reactive group are selected from the group consisting of a hydroxyl group ; a carboxyl 
group; a sulfonyl group; aphosphonyl group; an epoxide group; an aziridine group; and 
an isocyanate group. 

48. The method of Claim 45 wherein the reactive group and the complementary 
reactive group are selected from the group consisting of an amino group and an aldehyde 
or ketone group. 

49. The method of claim 45 wherein the reaction between the reactive group and the 
complementary reactive group is conducted under reducing conditions. 

50. The method of Claim 45 wherein the reactive group and the complementary 
reactive group are selected from the group consisting of a phosphorous ylide group and 
an aldehyde or ketone group. 

5 1 . The method of Claim 45 wherein the reactive group and the complementary 
reactive group react via cycloaddition to form a cyclic structure. 

52. The method of Claim 51 wherein the reactive group and the complementary 
reactive group are selected from the group consisting of an alkyne and an azide. 

53. The method of Claim 45 wherein the reactive group and the complementary 
functional group are selected from the group consisting of a halogenated heteroaromatic 
group and a nucleophile. 

54. The method of Claim 53 wherein the halogenated heteroaromatic group is 
selected from the group consisting of chlorinated pyrimidines, chlorinated triazines and 
chlorinated purines. 

55. The method of Claim 53 wherein the nucleophile is an amino group. 
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The method of claim 28, further comprising following cycle i, the step of: 
(f) cyclizing one or more of the functional moieties. 



57. The method of claim 56 wherein a functional moiety of step (f) comprises an 
azido group and an alkynyl group. 

58. The method of Claim 57 wherein the functional moiety is maintained under 
conditions suitable for cycloaddition of the azido group and the alkynyl group to form a 
triazole group, thereby forming a cyclic functional moiety 

59. The method of claim 58 wherein the cycloaddition reaction is conducted in the 
presence of a copper catalyst. 

60. The method of Claim 59 wherein at least one of the one or more functional 
moieties of step (f) comprises at least two sulfhydryl groups, and said functional moiety 
is maintained under conditions suitable for reaction of the two sulfhydryl groups to form 
a disulfide group, thereby cyclicizing the functional moiety. 

61 . The method of Claim 27 wherein the initial oligonucleotide comprises a PCR 
primer sequence. 

62. The method of claim 28, wherein the incoming oligonucleotide of cycle i 
comprises a PCR closing primer. 

63. The method of claim 28, further comprising following cycle i, the step of 

(d) ligating an oligonucleotide comprising a closing PCR primer sequence to the 
encoding oligonucleotide. 

64. The method of Claim 63 wherein the oligonucleotide comprising a closing PCR 
primer sequence is ligated to the encoding oligonucleotide in the presence of an enzyme 
which catalyzes said ligation. 

65. A compound of the formula 
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E 
B 
Z 

wherein: 

X is a functional moiety comprising one or more building blocks; 

Z is an oligonucleotide attached at its 3' terminus to B; 

Y is an oligonucleotide which is attached at its 5' terminus to C; 

A is a functional group that forms a covalent bond with X; 

B is a functional group that forms a bond with the 3 '-end of Z; 

C is a functional group that forms a bond with the 5' -end of Y; 

D, F and E are each, independently, a bifunctional linking group; and 

S an atom or a molecular scaffold. 

66. The compound of claim 65 wherein D, E and F are each independently an alkylene 
chain or an oligo(ethylene glycol) chain, and 

67. The compound of Claim 65, wherein Y and Z are substantially complementary and 
are oriented in the compound so as to enable Watson-Crick base pairing and duplex 
formation under suitable conditions. 

68. The compound of Claim 65 wherein Y and Z are the same length or different 
lengths. 

69. The compound of Claim 68 wherein Y and Z are the same length. 

70. The compound of claim 65, wherein Y and Z are each 10 or more bases in length 
and have complementary regions often or more base pairs. 

71 . The compound of Claim 65, wherein S is a carbon atom, a boron atom, a 
nitrogen atom, a phosphorus atom, or a polyatomic scaffold. 
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72. The compound of Claim 71 wherein S is a phosphate group or a cyclic group. 

73. The compound of Claim 72 wherein S is a cycloalkyl, cycloalkenyl, 
heterocycloalkyl, heterocycloalkenyl, aryl or heteroaryl group. 

74. The compound of claim 65 wherein the linker moiety is of the structure 

OP(0) 2 0- (CH 2 CH 2 0) m OPO 3 - 




-N (CH 2 ) n - 

-OP(0) 2 0- (CH 2 CH 2 0)p OPO 3 - 

wherein each of n, m and p is, independently, an integer from 1 to about 20. 



75. The compound of Claim 74 wherein each of n, m and p is independently an 
integer from 2 to eight. 

76. The compound of Claim 75 wherein each of n, m and p is independently an 
integer from 3 to 6. 



77. The compound of Claim 65 wherein the linker moiety has the structure 



HN 




78. The compound of Claim 65 wherein X and Y comprise a PCR primer sequence. 



79. A compound library comprising at least about 10 2 distinct compounds, said 
compounds comprising a functional moiety comprising two or more building blocks 
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which is operatively linked to an oligonucleotide which identifies the structure of the 
functional moiety. 

80. The compound library of Claim 79, said library comprising at least about 10 5 
copies of each of the distinct compounds. 

81. The compound library of claim 79, said library comprising at least about 10 6 
copies of each of the distinct compounds. 

82. The compound library of Claim 79 comprising at least about 10 4 distinct 
compounds. 

83. The compound library of Claim 79 comprising at least about 10 6 distinct 
compounds. 

84. The compound library of Claim 79 comprising at least about 10 8 distinct 
compounds. 

85. The compound library of Claim 79 comprising at least about 10 10 distinct 
compounds. 

86. The compound library of Claim 79 comprising at least about 10 12 distinct 
compounds. 

87. The compound library of claim 79 wherein said library comprises a multiplicity 
of compounds which are independently of Formula I: 




S 



E 



B 



Z 



(I) 



wherein: 
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X is a functional moiety comprising one or more building blocks; 

Z is an oligonucleotide attached at its 3' terminus to B; 

Y is an oligonucleotide which is attached at its 5' terminus to C; 

A is a functional group that forms a covalent bond with X; 

B is a functional group that forms a bond with the 3'-end of Z; 

C is a functional group that forms a bond with the 5 '-end of Y; 

D, F and E are each, independently, a bifunctional linking group; and 

S an atom or a molecular scaffold. 

88. The compound library of Claim 87 wherein A, B, C, D, E, F and S each have the 
same identity for each compound of Formula I. 

89. The compound library of Claim 87, said library consisting essentially of a 
multiplicity of compounds of Formula I. 

90. The compound library of claim 87 wherein D, E and F are each independently an 
alkylene chain or an oligo(ethylene glycol) chain. 

91. The compound library of Claim 87, wherein Y and Z are substantially 
complementary and are oriented in the compound so as to enable Watson-Crick base 
pairing and duplex formation under suitable conditions. 

92. The compound library of Claim 87 wherein Y and Z are the same length or 
different lengths. 

93. The compound library of Claim 87 wherein Y and Z are the same length. 

94. The compound library of claim 87, wherein Y and Z are each 10 or more bases in 
length and have complementary regions often or more base pairs. 

95. The compound library of Claim 87, wherein S is a carbon atom, a boron atom, a 
nitrogen atom, a phosphorus atom, or a polyatomic scaffold. 
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96. The compound library of Claim 87 wherein S is a phosphate group or a cyclic 
group. 

97. The compound library of Claim 96 wherein S is a cycloalkyl, cycloalkenyl, 
heterocycloalkyl, heterocycloalkenyl, aryl or heteroaryl group. 

98. The compound library of claim 87 wherein the linker moiety is of the structure 

OP(0) 2 0 (CH 2 CH 2 0) m OPO 3 - 




-N (CH 2 ) n 

-OP(0) 2 0- (CH 2 CH 2 0)p OP0 3 - 

wherein each of n, m and p is, independently, an integer from 1 to about 20. 



99. The compound library of Claim 98 wherein each of n, m and p is independently 
an integer from 2 to eight. 

100. The compound of Claim 99 wherein each of n, m and p is independently an 
integer from 3 to 6. 

101 . The compound of Claim 87 wherein the linker moiety has the structure 



HN 




102. The compound library of claim 87 wherein X and Z comprise a PCR primer 
sequence. 

103. A compound prepared by the method of Claim 1 . 

104. A compound library prepared by the method of Claim 27. 
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105. A method for identifying one or more compounds which bind to a 

biological target, said method comprising the steps of: 

(a) contacting the biological target with a compound library prepared by the 
method of Claim 27 under conditions suitable for at least one member of 
the compound library to bind to the target; 

(b) removing library members that do not bind to the target; 

(c) amplifying the encoding oligonucleotides of the at least one member of 
the compound library which binds to the target; 

(d) sequencing the encoding oligonucleotides of step (c); and 

(e) using the sequences determined in step (d) to determine the structure of 
the functional moieties of the members of the compound library which 
bind to the biological target; 

thereby identifying one or more compounds which bind to the biological 
target. 

106. A method for identifying a compound which binds to a biological target, said 
method comprising the steps of 

(a) contacting the biological target with a compound library comprising at 
least about 10 2 distinct compounds, said compounds comprising a 
functional moiety comprising two or more building blocks which is 
operatively linked to an oligonucleotide which identifies the structure of 
the functional moiety under conditions suitable for at least one member of 
the compound library to bind to the target; 

(b) removing library members that do not bind to the target; 

(c) amplifying the encoding oligonucleotides of the at least one member of 
the compound library which binds to the target; 

(d) sequencing the encoding oligonucleotides of step (c); and 

(e) using the sequences determined in step (d) to determine the structure of 
the functional moieties of the members of the compound library which 
bind to the biological target; 
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thereby identifying one or more compounds which bind to the biological 



target. 



107. The method of Claim 106 wherein the library comprises at least about 10 5 
copies of each of the distinct compounds. 

108. The method of claim 106 wherein the library comprises at least about 10 6 copies 
of each of the distinct compounds. 

109. The method of claim 106 wherein the library comprises at least about 10 4 distinct 
compounds. 

110. The method of Claim 106 wherein the library comprises at least about 10 6 
distinct compounds. 

111. The method of Claim 106 wherein the library comprises at least about 10 8 
distinct compounds. 

112. The method of Claim 106 wherein the library comprises at least about 10 10 
distinct compounds. 

113. The method of Claim 106 wherein the compound library comprises at least about 
10 12 distinct compounds. 

1 14. The method of claim 106 wherein the compound library comprises a multiplicity 
of compounds which are independently of Formula I: 




E 



B 



Z 



(I) 



wherein: 
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X is a functional moiety comprising one or more building blocks; 

Z is an oligonucleotide attached at its 3' terminus to B; 

Y is an oligonucleotide which is attached at its 5' terminus to C; 

A is a functional group that forms a covalent bond with X; 

B is a functional group that forms a bond with the 3 '-end of Z; 

C is a functional group that forms a bond with the 5 '-end of Y; 

D, F and E are each, independently, a bifunctional linking group; and 

S an atom or a molecular scaffold. 

115. The method of Claim 114 wherein A, B, C, D, E, F and S each have the same 
identity for each compound of Formula I. 

116. The method of Claim 1 14 wherein the compound library consists essentially of a 
multiplicity of compounds of Formula L 

.117. The method of claim 1 14 wherein D, E and F are each independently an alkylene 
chain or an oligo(ethylene glycol) chain. 

118. The method of Claim 1 14, wherein Y and Z are substantially complementary and 
are oriented in the compound so as to enable Watson-Crick base pairing and duplex 
formation under suitable conditions. 

119. The method of claim 114 wherein Y and Z are the same length or different 
lengths. 

120. The method of Claim 119 wherein Y and Z are the same length. 

121 . The method of claim 114, wherein Y and Z are each 10 or more bases in length 
and have complementary regions of ten or more base pairs. 

122. The method of Claim 114, wherein S is a carbon atom, a boron atom, a nitrogen 
atom, a phosphorus atom, or a polyatomic scaffold. 
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123. The method of Claim 114 wherein S is a phosphate group or a cyclic group. 

124. The method of Claim 123 wherein S is a cycloalkyl, cycloalkenyl, 
heterocycloalkyl, heterocycloalkenyl, aryl or heteroaryl group. 

125. The method of claim 114 wherein the linker moiety is of the structure 



-N (CH 2 ) n 




OP(0) 2 0- (CH 2 CH 2 0) m OPO 3 - 

OP(0) 2 0 (CH 2 CH 2 0)p OP0 3 - 



wherein each of n, m and p is, independently, an integer from 1 to about 20. 

126. The method of Claim 125 wherein each of n, m and p is independently an integer 
from 2 to eight. 

127 : The method of Claim 126 wherein each of n, m and p is independently an integer 
from 3 to 6. 



128. The method of Claim 127 wherein the linker moiety has the structure 



HN 




129. The method of claim 1 14 wherein X and Z comprise a PCR primer sequence. 
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Figure 2 



WO 2005/058479 



3/13 



PCT/US2004/042964 




WO 2005/058479 



4/13 



PCT/US2004/042964 




Figure 4 
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Figure 5 
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Figure 1 1 
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Figure 12 
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SEQUENCE LISTING 

<110> PRAECIS PHARMACEUTICALS, INC., et al . 

<120> METHODS FOR SYNTHESIS OF ENCODED 
LIBRARIES 

<130> PPI-156PC 

<150> 60/530854 
<151> 2003-12-17 

<150> 60/540681 
<151> 2004-01-30 

<150> 60/553715 
<151> 2004-03-15 

<150> 60/588672 
<151> 2004-07-16 

<160> 890 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 

<400> 1 
gcaacgaag 

<210> 2 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 

<400> 2 
tcgttgcca 

<210> 3 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 3 
gcgtacaag 

<210> 4 
<211> 9 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 4 

tgtacgcca 9 

<210> 5 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 6 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 6 

acagagcca 9 

<210> 7 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<210> 8 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 8 

atggcacca 9 

<210> 9 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 



<400> 5 
gctctgtag 



9 



<400> 7 
gtgccatag 



9 
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<400> 9 
gttgaccag 



9 



<210> 10 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 10 

ggtcaacca 9 

<210> 11 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 12 

acgctgaac 9 

<210> 13 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 14. 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> ; 14 

gactacgca 9 



<400> 11 
cgacttgac 



9 



<210> 12 



<400> 13 
cgtagtcag 



9 



<210> 15 
<211> 9 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 15 
ccagcatag 

<210> 16 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 16 
atgctggca 

<210> 17 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 17 
cctacagag 

<210> 18 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 18 
ctgtaggca 

<210> 19 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 19 
ctgaacgag 

<210> 20 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
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9 



9 



9 



9 



4/162 



WO 2005/058479 



PCT/US2004/042964 



<400> 20 
acgacttgc 



9 



<210> 21 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 21 

ctccagtag 9 

<210> 22 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 23 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 23 

taggtccag 9 

<210> 24 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 25 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 25 

gcgtgttgt 9 



<400> 22 
actggagca 



9 



<400> 24 
ggacctaca 



9 



<210> 26 
<211> 9 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 26 

aacacgcct 9 

<210> 27 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 28 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 28 

tccaagcct 9 

<210> 29 

<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 29 

gtcaagcgt 9 

<210> 30 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 31 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<400> 27 
gcttggagt 



9 



<400> 30 
gcttgacct 



9 
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<400> 31 
caagagcgt 

<210> 32 
<211> 9 
<212> DNA. 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 32 
gctcttgct 

<210> 33 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 

<400> 33 
cagttcggt 

<210> 34 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 34 
cgaactgct 

<210> 35 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 35 
cgaaggagt 

<210> 36 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<400> 36 
tccttcgct 

<210> 37 
<211> 9 
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<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 



<400> 37 
cggtgttgt 



9 



<210> 38 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 38 

aacaccgct 9 

<210> 39 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 40 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 40 

agcaacgct 9 

<210> 41 

<211> 9 

<212> DNA 

<213> Artificial Sequence 



<210> 42 

<211> 9 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<400> 39 
cgttgctgt 



9 



<220> 

<223> synthetic construct 



<400> 41 
ccgatctgt 



9 
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<400> 42 
agatcggct 

<210> 43 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 43 
ccttctcgt 

<210> 44 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 44 
gagaaggct 

<210> 45 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 45 
tgagtccgt 

<210> 46 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 46 
ggactcact 

<210> 47 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 47 
tgctacggt 

<210> 48 
<211> 9 
<212> DNA 
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9 

9 

9 

9 
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<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 48 
cgttagact 

<210> 49 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 49 
gtgcgttga 

<210> 50 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 50 
aacgcacac 

<210> 51 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 51 
gttggcaga 

<210> 52 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 52 
tgccaacac 

<210> 53 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 53 
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9 



9 



9 



9 
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cctgtagga 



9 



<210> 54 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 54 

ctacaggac 9 

<210> 55 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 56 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 56 

tacgcagac 9 

<210> 57 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 58 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 58 

gcgtaagac 9 

<210> 59 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<400> 55 
ctgcgtaga 



9 



<400> 57 
cttacgcga 



9 
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<220> 

<223> synthetic construct 



<400> 59 
tggtcacga 



9 



<210> 60 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 60 

gtgaccaac 9 

<210> 61 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 62 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 
<400> 62 

gctctgaac 9 

<210> 63 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 64 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 64 



<400> 61 
tcagagcga 



9 



<400> 63 
ttgctcgga 



9 
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cgagcaaac 



9 



<210> 65 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 65 

gcagttgga 9 

<210> 66 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 67 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 67 

gcctgaaga 9 

<210> 68 
<211> 9 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 68 

ttcaggcac 9 

<210> 69 
<211> 9 
<212>.DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<400> 66 
caactgcac 



9 



<400> 69 
gtagccaga 



9 



<210> 70 
<211> 9 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 70 
tggctacac 



<210> 71 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 71 
gtcgcttga 

<210> 72 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 72 
aagcgacac 

<210> 73 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 73 
gcctaagtt 

<210> 74 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 74 
cttaggctc 

<210> 75 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
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9 



9 



9 



9 
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<400> 75 
gtagtgctt 



9 



<210> 76 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 76 

gcactactc 9 

<210> 77 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 78 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 78 

cttcgactc 9 

<210> 79 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 80 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 80 

ccgaaactc 9 

<210> 81 
<211> 9 
<212> DNA 



<400> 77 
gtcgaagtt 



9 



<400> 79 
gtttcggtt 



9 
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<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 81 
cagcgtttt 

<210> 82 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 82 
aacgctgtc 

<210> 83 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 83 
catacgctt 

<210> 84 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 84 
gcgtatgtc 

<210> 85 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 85 
cgatctgtt 

<210> 86 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 86 
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9 



9 



9 



9 
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cagatcgtc 



9 



<210> 87 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 87 

cgctttgtt 9 

<210> 88 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 89 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 89 

ccacagttt 9 

<210> 90 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 91 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 91 

cctgaagtt 9 

<210> 92 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<400> 88 
caaagcgtc 



9 



<400> 90 
actgtggtc 



9 
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<220> 

<223> synthetic construct 



<400> 92 
cttcaggtc 



9 



<210> 93 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 93 

ctgacgatt 9 

<210> 94 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 95 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 95 

ctccacttt 9 

<210> 96 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 97 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 97 

accagagcc 9 



<400> 94 
tcgtcagtc 



9 



<400> 96 
agtggagtc 



9 
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<210> 98 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 98 
ctctggtaa 

<210> 99 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 99 
atccgcacc 

<210> 100 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 100 
tgcggataa 

<210> 101 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 101 
gacgacacc 

<210> 102 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 102 
tgtcgtcaa 

<210> 103 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> synthetic construct 

<400> 103 
ggatggacc 

<210> 104 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 104 
tccatccaa 

<210> 105 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 105 
gcagaagcc 

<210> 106 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 106 
cttctgcaa 

<210> 107 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 107 
gccatgtcc 

<210> 108 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 108 
acatggcaa 
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9 



9 



9 



9 



9 
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<210> 109 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 109 
gtctgctcc 

<210> 110 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 110 
agcagacaa 

<210> 111 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 111 
cgacagacc 

<210> 112 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 112 
tctgtcgaa 

<210> 113 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 113 
cgctactcc 

<210> 114 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 
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9 



9 



9 



9 
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<223> synthetic construct 

<400> 114 
agtagcgaa 

<210> 115 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 115 
ccacagacc 

<210> 116 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 116 
tctgtggaa 

<210> 117 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 117 
cctctctcc 

<210> 118 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 118 
agagaggaa 

<210> 119 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 119 
ctcgtagcc 

<210> 120 
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9 



9 



9 



9 



9 
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<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 120 
ctacgagaa 

<210> 121 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 121 

aaatcgatgt ggtcactcag 

<210> 122 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 122 

gagtgaccac atcgatttgg 

<210> 123 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 123 

aaatcgatgt ggactaggag 

<210> 124 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 124 

cctagtccac atcgatttgg 

<210> 125 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



PCT/US2004/042964 



9 



20 



20 



20 
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<400> 125 

aaatcgatgt gccgtatgag 



20 



<210> 126 
<211> 20 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 126 

catacggcac atcgatttgg 



20 



<210> 127 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 127 

aaatcgatgt gctgaaggag 2 0 

<210> 128 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 128 

ccttcagcac atcgatttgg 20 

<210> 129 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 129 

aaatcgatgt ggactagcag 20 

<210> 130 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 130 

gctagtccac atcgatttgg 20 

<210> 131 
<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 131 

aaatcgatgt gcgctaagag 2 0 

<210> 132 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 132 

cttagcgcac atcgatttgg 2 0 

<210> 133 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 133 

aaatcgatgt gagccgagag 20 

<210> 134 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 134 

ctcggctcac atcgatttgg 20 

<210> 135 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 135 

aaatcgatgt gccgtatcag 20 

<210> 136 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
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<400> 136 

gatacggcac atcgatttgg 

<210> 137 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 137 

aaatcgatgt gctgaagcag 

<210> 138 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 138 

gcttcagcac atcgatttgg 

<210> 139 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 139 

aaatcgatgt gtgcgagtag 

<210> 140 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 140 

actcgcacac atcgatttgg 

<210> 141 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 141 

aaatcgatgt gtttggcgag 

<210> 142 
<211> 20 



PCT/US2004/042964 
20 

20 

20 

20 

20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 142 

cgccaaacac atcgatttgg 2 0 

<210> 143 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 143 

aaatcgatgt gcgctaacag 20 

<210> 144 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 144 

gttagcgcac atcgatttgg 2 0 

<210> 145 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 145 

aaatcgatgt gagccgacag 2 0 

<210> 146 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 146 

gtcggctcac atcgatttgg 2 0 

<210> 147 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
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<400> 147 

aaatcgatgt gagccgaaag 

<210> 148 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 148 

ttcggctcac atcgatttgg 

<210> 149 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 149 

aaatcgatgt gtcggtagag 

<210> 150 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 150 

ctaccgacac atcgatttgg 

<210> 151 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> : 151 

aaatcgatgt ggttgccgag 

<210> 152 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 152 

cggcaaccac atcgatttgg 

<210> 153 
<211> 20 
<212> DNA 
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20 

20 

20 

20 

20 
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<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 153 

aaatcgatgt gagtgcgtag 

<210> 154 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 154 

acgcactcac atcgatttgg 

<210> 155 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 155 

aaatcgatgt ggttgccaag 

<210> 156 . 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 156 

tggcaaccac atcgatttgg 

<210> 157 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 157 

aaatcgatgt gtgcgaggag 

<210> 158 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 158 
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20 



20 



20 



20 
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cctcgcacac atcgatttgg 

<210> 159 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 159 

aaatcgatgt ggaacacgag 

<210> 160 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 160 

cgtgttccac atcgatttgg 

<210> 161 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 161 

aaatcgatgt gcttgtcgag 

<210> 162 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 162 

cgacaagcac atcgatttgg 

<210> 163 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 163 

aaatcgatgt gttccggtag 

<210> 164 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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20 



20 



20 



20 



20 
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<220> 

<223> synthetic construct 
<400> 164 

accggaacac atcgatttgg 

<210> 165 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 165 

aaatcgatgt gtgcgagcag 

<210> 166 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 166 

gctcgcacac atcgatttgg 

<210> 167 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 167 

aaatcgatgt ggtcaggtag 

<210> 168 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 168 

acctgaccac atcgatttgg 

<210> 169 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 169 

aaatcgatgt ggcctgttag 
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20 



20 



20 



20 



20 
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<210> 170 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 170 

aacaggccac atcgatttgg 

<210> 171 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 171 

aaatcgatgt ggaacaccag 

<210> 172 
; <211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 172 

ggtgttccac atcgatttgg 

<210> 173 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 173 

aaatcgatgt gcttgtccag 

<210> 174 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 174 

ggacaagcac atcgatttgg 

<210> 175 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



20 



20 



20 



2 0 
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<220> 

<223> synthetic construct 
<400> 175 

aaatcgatgt gtgcgagaag 

<210> 176 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 176 

tctcgcacac atcgatttgg 

<210> 177 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 177 

aaatcgatgt gagtgcggag 

<210> 178 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 178 

ccgcactcac atcgatttgg 

<210> 179 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 179 

aaatcgatgt gttgtccgag 

<210> 180 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 180 

cggacaacac atcgatttgg 
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20 



20 



20 
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<210> 181 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 181 

aaatcgatgt gtggaacgag 

<210> 182 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 182 

cgttccacac atcgatttgg 

<210> 183 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 183 

aaatcgatgt gagtgcgaag 

<210> 184 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 184 

tcgcactcac atcgatttgg 

<210> 185 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 185 

aaatcgatgt gtggaaccag 

<210> 186 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 
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20 



20 



20 



20 
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<223> synthetic construct 
<400> 186 

ggttccacac atcgatttgg 

<210> 187 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 187 

aaatcgatgt gttaggcgag 

<210> 188 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 188 

cgcctaacac atcgatttgg 

<210> 189 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 189 

aaatcgatgt ggcctgtgag 

<210> 190 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 190 

cacaggccac atcgatttgg 

<210> 191 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 191 

aaatcgatgt gctcctgtag 
<210> 192 
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20 



20 



20 



20 



20 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 192 

acaggagcac atcgatttgg 2 0 

<210> 193 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 193 

aaatcgatgt ggtcaggcag 2 0 

<210> 194 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 194 

gcctgaccac atcgatttgg 2 0 

<210> 195 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 195 

aaatcgatgt ggtcaggaag 20 

<210> 196 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 196 

tcctgaccac atcgatttgg 2 0 

<210> 197 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
" <220> 
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<22 3> synthetic construct 
<400> 197 

aaatcgatgt ggtagccgag 2 0 

<210> 198 

<211> 20 . 

<212> DNA ! 

<213> Artificial Sequence 



<220> ; 
<223> synthetic 



construct 



<400> 198 

cggctaccac atcgatttgg 2 0 

<210> 199 
<211> 20 

<212> DNA "I''- 
<213> Artificial Sequence 

<220> . - 

<22 3> synthetic construct 
<4 ! 00> 199 

aaatcgatgt ggcctgtaag 20 

<210> 200 ; 
<211> 20 
<212> DNA 

<213> A.rtificial Seiquence 

<220> . 

<22 3 > synthetic construct 

<400> 200 

tacaggccac atcgatttgg 2 0 

<210> 201 

<211> 20 

<212> DNA ' i 

<2 ; 13> Artificial Sequence 

<2,20> ;• 

<223> synthetic cohistruct 
<400> 201 \ 

aaatcgatgt gctttcggag 20 

<210> 202 
<211> 20 
<212> DNA 1 . 

<213> Artificial Sequence 
<220> ' 

<2^23> synthetic construct 
<400> 202 ' ! 

ccgaaagcac atcgatttgg 20 
<210> 203 I 
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<211> 20 
<212> DNA \ 
<213> Artificial 



Sequence. 



<220> 

<22 3> synthetic construct 

' \ : ■ ' ' i ' 

<400> 203: 

aaatcgatgt gcgtaaggag 2 0 

<210> 204; •; 

<211> 20 

<212> DNA . "j .; 

<213> Artificial Sequence 

<220> i 

<22 3> synthetic construct 

<400> 204 . ! 

ccttacgcac jatcgatttgg ■, 2 0 

<210> 205 ; 

<211> 20 ; 

<212> DNA 

<213> Artificial 'Sequence' 

■ i , ■ ' • . ; 

<220> . :i 

<22 3> synthetic construct 

<400> 205 I : 

aaatcgatgt :gagagcgta;g 2 0 

<210> 206, :: ■ ;| 

<211> 20 I : M; ; : 

<212> DNA : 

<213> Artificial Sequence 

<220> -\ • 

<223> synthetic construct 

<400> 206; 

acgctctcac .-atcgatt'tigg. 20 

: T - " : : : .■ ■ ■! ! ■ 

<2io>. 207/ ; \ ; :|. !;■:■■ ; 
• <2ii> 20 " i: ■ ; =|" j! ' : - ; 

<212> DNA - J. ■ j s : j. J j 
<213> Artificial ;Sequerice 

■ '< ' ', \ - ■ ■ ■ 
<220> . ;j :i- ; , \ 

<223> synthetic : construct 
<400> 207 . : i! j ; V \ 

aaatcgatgt ggacggcaag 20 

<2io> 208 . ' H i; - : " : 

<211> 20 . ; j 
<212> DNA : '\ 



<213> Artifibial Sequence 



<220> ... | . 
<223> synthetic construct; 
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<400> 208 

tgccgtccac atcgatttgg 



20 



<210> 209 
<211> 20 
<212>, DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 209 

aaatcgatgt gctttcgcag 



20 



<210> 210 
<211> 20 
<212> DNA 

<2 13 > Artificial Sequence 
<220> 

<22 3> synthetic construct 

•i 

<400> 210 

gcgaaagcac atcgatttgg 20 

<210> 211 
<211> 2 0 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 211 

aaatcgatgt gcgtaagcag ,20 

<210> 212 j 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> ;212 

gcttacgcac atcgatttgg 20 

<210> 213 

<211> 20 

<212> DNA ; ' 

<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 



<400> 213 

aaatcgatgt ggctatggag 



20 



<210> 214 
<211> 20 
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<212^ r DNA 

<213>> Artificial Sequence 
<220> 

<223^ synthetic construct 
<400>i 214 

ccatagccac atcgatttgg 

<210:> 215 
<211> 20 
<212>: DNA 

<213> Artificial Sequence 
<220> 

;<223> synthetic construct 
<400> 215 

aaatcgatgt gactctggag 

<210>i 216 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220i> 

<2 2 3:> synthetic construct 
<400> 216 

ccagaigtcac atcgatttgg 

<210> 217 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223>i synthetic construct 

i ! 

" " i : ■ ■ ! 

<400> 217 ■ j 

aaatcgatgt gctggaaag ' 

<21G* 218 \ 

k211^ 19 ■■ j 

<212> DNA j 

<213> Artificial Sequence 

* = '■ i i 

<220* j 
,<223> synthetic construct 

i l 
' ■ I 

<400> 218 

ttccagcaca tcgatttgg 

k210> 219 
<211> 20 
<212:i DNA . 

<213^> Artificial Sequence 

<220> • ! 

<223> synthetic construct 

' : , i ' • - ! 

: « I * I 

' * I I 



20 



20 



19 



19 



i 
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<400> 219 

aaatcgatgt gccgaagtag 

<210> 220 . 
<211> 20 
<212> DNA 
<213> Artificial Sequence 



:2:0 



<220> 



<223> synthetic 



construct 



<400> 220 
acttcggcac atcgatttgg 



20 



<210> 221 ; 
<211> 20 : : 
<212> DNA : 

<213> Artificial Sequence 



<220> ! ;. 
; <22 3 > synthetic ; construct 



<400> 221 : ; 

aaatcgatgt gctcctgaag 



20 



<210> 222 
<211> 20 . 
<212> DNA 

<213> Artificial Sequence 
:<220> . 

<223> synthetic i construct 



<400> 222: 

tcaggagcac atcgatttgg 



20 



<210> 223 
,<211> 20 
<212> DNA' 

;<213> Artificial Sequence 

<220> =• \ - 

<223> synthetic • construct 



<400> 223 . 

.aaatcgatgt gtccagtcag 



20 



<210> 224: 
<211> 20 :' 
<212> DNA' 

<213> Artificial Sequence 
;<220> ; ':: 

<223> synthetic construct 



<400> 224 

gactggaca;c atcgatttgg 

<210> 225 
<211> 20 ' 



20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 225 

aaatcgatgt gagagcggag 

<210> 226 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 226 

ccgctctcac atcgatttgg 

<210> 227 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 227 

aaatcgatgt gagagcgaag 

<210> 228 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> synthetic construct 
<400> 228 

tcgctctcac atcgatttgg 

<210> 229 
<211> 2.0 

<212> DNA ! 
<213> Artificial Sequence 

<220> ; 

<223> synthetic construct 
<400> 229 

aaatcgatgt gccgaaggag 

<210> 230 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
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20 



20 



20 



20 
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<400> 230; 

ccttcggcac atccjatttgg 

<210> 231 
<211> 20 , 
<212> DNA 

<213> Artificial Sequence 

<220> ; : i ■ :■ ; 

<223> synthetic construct 

<400> 231 ; 

aaatcgatgt gccgaagcag 

<210> 232 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 232 

gcttcggcac atcgatttgg 

<210> 233 

<211> 20 

<212> DNA: 

<213> Artificial Sequence 
<220> 

<:22 3> synthetip construct 

<400> 233^ 

aaatcgatgt gtgttccgag 

<210> 234 
<211> 20 

<212> DNA . 
<213> Artificial Sequence 

<220> : . l . ; 

<223> synthetic construct 

<400> 234i "; ! 
cggaacacab atcgatttgg 

<210> 235 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> : 

<22 3> synthetic construct 
<400> 235 

aaatcgatgt gtctggcgag 
; <210;> 236 

<211> 20 I '■ . ■ ; : 
<212> DNA ; 1 



PCT/US2004/042964 

20 

20 
:2 0 
20 
2 0 
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<213> Artificial Sequence 
<220> 

<2 23 > synthetic construct 
<400> 236 . 

cgccagacac atcgatttgg' 

<210> 237 I 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> ! 
<223> synthetic construct 

<400> 237 

aaatcgatgt gctatcggag 

<210> 238 
<211> 20 ; 

<212> DNA . ! 
<213> Artificial Sequence . 

<220> | 
<223> synthetic construct 

<400> 238 

ccgatagcac atcgatttgg 

<210> 239 
<211> 20 

<212> DNA • 

<213> Artificial Sequence 

i 

i 

<220> 

<223> synthetic construct 

<400> 239 I 
aaatcgatgt gcgaaaggag 

' ■ ! ■ i 

<2io> 240 ;: I 
<211> -20 ' ■ : . 
! <2i2> dna ; ;; ; '!. ; 

<213> Artificial • Sequence 

<220> j 
<223> synthetic . construct 

<400> 240 ■■ ''-I', 
cctttcgcac : atcgatttgg 

<210> 241 

<211> 20 j 
<212> DNA \ 
<213> Artificial Sequence 

<220> j 
<223> synthetic construct 

<4oo> 241. ; . 
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20 



20 



20 



20 
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aaatcgatgt gccgaagaag 

<210> 242 • . I 
; <2;11> 20 ; 
' <212> DNA 

<213> Artificial Sequence 

: <220> 

<223> synthetic construct 

<400> 242 
. tcttcggcac atcgatttgg ; 

\ i • ■ ■ 

<2l0> 243 ■ ; 

: <211> 2 0 ; 
<212> DNA 

<213> Artificial Sequence: 
<220> 

<2 23> synthetic construct 1 

<400> 243 J 
aaatcgatgt ggttgcagag 

<210> 244 : 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> :■ 

^2 23 > synthetic construct 

<400> 244 , ! ! 

ctgcaaccac atcgatttgg' 

<210> 245 \ 
<211> 20 

<212> DNA j 
<213> Artificial Sequence 1 

<220> ' I 

<223> synthetic construct 

: : ' i ' " : 

<4!00> 245 | !; j \ 

aaatcgatgt ggatggtgag 

<210> 246 ■' j ; 

<211> 20 ; ; ;! 

<212> DNA ■ \ 

<213> Artificial Sequence 

; 1 i 
<220> • i 

i ■ : ; 

<223> synthetic construct 
<400> 246 ' 

caccatccac atcgatttgg 

<2l0> 247 ; . 

<211> 20 ; : 

<212> DNA : : j , [ 

<2;13> Artificial Sequence 



PCT/US2004/042964 
20 



20 



2 0. 



2 0 



2 0 ( 
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<220> :/:.!. : 
<22 3> synthetic construct 

<400> 247 j ; 

aaatcgatgt ;g<btatcgcag 20 
<210> 248 

<211> 20 ; ! i ■ 

<212> DNA ; 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 248 

gcgatagcac atcgatttgg 20 

<210> 249 
<211> -20 : : ] ! 
<212> DNA 

<213> Artificial Sequence 

<220> ! ; 

<223> synthetic construct 

<400> 249 

aaatcgatgt gcgaaagcag ,20 

<210> 250 
<211> 20 

<212> DNA : ! . ; 

<213> Artificial Sequence 

<220> 

<223> synthetic construct 
<400> 250 

gctttcgcac atcgatttgg 20 

<210> 251 : ; ' ! 

<211> 20 M ; 

<212> DNA : ^ 

<213> Artificial Sequence 

<22p> ■ •; ; ! \ 
<223> synthetic construct 

<400> 251 

aaatcgatgt ^gacactgga^ 20 
<210> 252 

<211> 20 ; 
<212> DNA ' 

<213> Artificial Sequence 
<220> 

<223> synthetic 'construct 

<400> 252- , 

ccagtgtcac latcgatttgg 20 
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<210> 253 
<211> 20 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 253 

aaatcgatgt gtctggcaag 



20 



<210> 254 

<211> 20 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 254 

tgccagacac atcgatttgg 20 

<210> : 255 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 255 



<210> 256 
<211> 20 
< 2 1 2 > DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400>: 256 ' ' 
gaccatccac atcgatttgg 20[ 

<210> 257 
<211>;20 
<212> DNA 

<213> Artificial Sequence '[ 
<220> 

<223> synthetic construct 
<400> 257 

aaatcgatgt ggttgcacag 20 

<210>258 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



aaatcgatgt ggatggtcag 



20 
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<220> 

<223> synthetic construct 
<400> 258 

gtgcaaccac atcgatttgg 

<210> 259 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 259 . 
aaatcgatgt gggcatcgag 

<210> 260 
<211> 20 
<212> DNA 

<2 13 > Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 260 

cgatgcccca tccgatttgg 

<210> 261 , 
<211> 20 
<2 12 > DNA 

<213> Artificial Sequence 

<220> ! 
<223> synthetic construct 

<400> 261 

aaatcgatgt gtgcctccag 

<210> 262 ■ 

<211> 20 . 

<212> DNA I . 

<213> Artificial Sequence . 

<220> 

<22 3> synthetic ,construct 
<400> 262 

ggaggcacac atcgatttgg 

<210> 263 
<211> 20, 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic ^construct 

<400> 263 ; 
aaatcgatgt gtgcctcaag ; 
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■ ;20 



20 



20 



20 



20 
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<210> 264 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

I 

<220> 

<2 23> synthetic construct 

<400> 264 ; 
tgaggc'acac atcgatttgg 

<210> 265 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<22o> ! ; 

<223> synthetic 'construct 

<400> ;265 

aaatcgatgt gggcatccag 

<210> 266 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 266 

ggatgcccac atcgatttgg 

<210> 267 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 

<400> 267 ; ' 

aaatcgatgt gggcatcaag 

<210> 268 

<211> -20 ; 

<212> DNA • ' • 

<213> Artificial isequence 

<220> - \ ;. 

<223> isynthetic construct 

<400> ^268 

tgatgcccac ;atcgatttgg 

<210> |269 

<211> : 20 - 

<212> DNA 1 

<213> Artificial Sequence 



PCT/US2004/042964 



20 



20 



20 



20 



i 20 



49/162 



WO 2005/058479 

<220> 

<2 2 3> synthetic' construct 
<400> 269 

aaatcgatgt gcctgtcgag 

<210> 270 
<211> 20 
<212> DNA ; 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 270 

cgacaggcac atcgatttgg 

<210> 271 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 271 

aaatcgatgt ggacggatag 

<210> 272 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 272 

atccgtccac atcgatttgg 

<210> 273 

<211> 20 ; : 

<212> DNA 

<213> Artificial ; Sequence 

: <220> : ; 
<223> synthetic construct 

<400> 273 

aaatcgatgt gbctgtccag 

<210> 274 ; 
<211> 20 
<212,> DNA !. 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 274 

ggacaggcac atcgatttgg 
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: 20 



20 



20 



20 



; 20 
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<210> 275 ' . 

<2ii> 20 ; ! 

<212> DNA ; 

<213> Artificial; Sequence 



<220> 

<22 3> synthetic construct 



<400> 275 ; ; '[ 

aaatcgatgt gaagcacgag 



20 



<210> 276 ; : j 

<211> 20 
<212> DNA . 

<213> Artificial' Sequence 



<220> 

<2 23> synthetic construct 



<400> 276 

cgtgcttcac atcgatttgg 



20 



<210> 277 ' ; 

<211> 20 
<212> DNA 

<213> Artificial' Sequence 
<220> 

<2 23> synthetic construct 
<400> 277 

aaatcgatgt gcctgtcaag 2 0 

<210> 278 
<211> 20 
<212> DNA 

<213> Artificial; Sequence 



<220> 

<223> synthetic construct 



<400> 278 ; . 
tgacaggcac atcgatttgg 



20 



<210> 279 
<211> 20 
<212> DNA 



<213> Artificial; Sequence 



<220> i : 

<223> synthetic construct 



<400> 279 

aaatcgatgt gaagcaccag 



20 



<210> 280 

<211> 20 [ .' 

<212> DNA : " ' . ' • ' 

<213> Artificial: Sequence 



■ : ■! f 
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<220> 

<223> synthetic construct 
<400> 280^ 

ggtgcttcab atcgatttgg 

I 

<210> 281; I 
<211> 20 ; : . 
<212> DNA : 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 
<400> 281 

aiaatcgatgt gccttcgtag 

<210> 282! ! 
<2il> 20 I 
<212> DNA : : 

<213> Art-ificial Sequence 
<220> 

<223> synthetic construct 
. <400> 282 

acgaaggcac atcgatttgg 

; 

<210> 2 83; 
<211> 20 : 
<212> DNA ; 

<213> Artificial Sequence 
<220> . i ■■ 

<223> synthetic construct 
<400> 283! 

aaatcgatgjt gtcgtccgag 

I 

<210> 2 84| 
■ <211> 20 j , 
<212> DNA : 

<213> Artificial Sequence 

■ ■ ■ 'j* i ' . 

- <220> ;..«:-.■ I ' 

<223> synthetic construct 

j ' 

<400> 284= : . 

I ; ■ 

cggacgacac .atcgatttgg 

<210> 285 i 
<211> 20 j 
<212> DNA ; 

<213> Artificial Sequence 
<220> .j 

. <223> synthetic construct 

' : i ■ 

<j400> 285| : 
aaatcgatgft ggagtctgag 

f . 



20 



20 



20 



2 0 



20 

i ■ . 
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<210> 286 i 
<211> 20 
<212> DNA , 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 286 j 

cagactccac- atcgatttgg 



20 



<210> 287 \ 
<211> 20 
<212> DNA ■ 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 288 i 
<211> 20 : 
<212> DNA : 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 
<400> 288 J 

cggatcacac; atcgatttgg 20 

<210> 289 \ 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 289 | .; 

aaatcgatgt, gtcaggcgag 20 

<210> 290 
<211> 20 • 
<212> DNA ! 

<213> Artificial Sequence , 
<220> 

<223> synthetic construct 

: i 

<400> 290 i 



<400> 287 ! 

aaatcgatgt; gtgatccgag 



20 



cgcctgacaC; atcgatttgg 



20 



<210> 291 'J 
<211> 2 0 • 
<212> DNA • ! 



<213> Artificial Sequence 



<220> 
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<223> synthetic construbt 



<400> 291 

aaatcgatgt gtcgtccaag 



20 



<210> 292 

<211> 20 : 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 292 

tggacgacac atcgatttgg 



20 



<210> 293 



<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 293 

aaatcgatgt ggacggagag 2 0 

<210> 294 

<211> 20 

<212> DNA . . 

<213> Artificial Sequence 



<210> 295 
<211> 20 
<212> DNA; ' 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 295 

aaatcgatgt ggtagcagag 20 

<210> 296 
<211> 20 
<212> DNA 



<220> 

<22 3> synthetic construct 



<400> 294 

ctccgtccac atcgatttgg 



20 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 296 

ctgctaccac atcgatttgg 



20 



<210> 297 
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<211> : 20; 
<212> DNA ; 

<213> . Artificial Sequence 



<220>. 

<223> , synthetic construct 



<400> 297 

aaatcgatgt ggctgtgtag 



20 



<210>. 298 
<211> 2 0 ! 
<212>;DNA . 

<213> Artificial Sequence 
<220> , 

<223> synthetic construct 
<400> 298 : 

acacagccac, atcgatttgg 20 

<210> 299 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 299 

aaatcgatgt ggacggacag 20 

<210> 300 : 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 300 

gtccgtccac atcgatttgg 



20 



<210> 301 ; 
<211> 20 
<212>;DNA , 

<213> Artificial Sequence 



<220> ; 

<223> : synthetic construct 



<400> 301 

aaatcgatgt gtcaggcaag 



20 



<210> 302 ; 
<211> 20 I 
<212>, DNA ! r 

<213> Artificial Sequence 



<220>: 
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<223> synthetic construct 1 

<400> 302 ; • 
tgcctgacac atcgatttgg 

<210> 303 
<211> 20 

<212> DNA ; 
<213> Artificial Sequence 

<220> • ; 

<22 3> synthetic construct 
<400> 303 

aaatcgatgt ggctcgaaag 

<210> 304 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 304 

ttcgagccac atcgatttgg 

<210> 305 
<211> 20 
<212> DNA 

<213> Artificial' Sequence 
<220> 

<223> synthetic construct 
<400> 305 

aaatcgatgt gccttcggag j 

<210> 306 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> > \ 

<223> synthetic construct 

<400> 306 : 
ccgaaggcac atcgatttgg 

<210> 307 

<211> 20 i 

<212> DNA i i 

<213> Artificial Sequence 

l ; : " ■ 

<22 0> 

<;223> synthetic construct 

<400> 307 ! ' 
aaatcgatgt ggtagcacag ! r 

<210> 308 i 
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20 



20 



20 



20 



20 
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<211> 2 0 
•<212> DNA 

:<213> Artificial Sequence; 
<220> .. 

!<2 23> synthetic construct 
<400> 308 

gtgctaccac atcgatttgg 

309 , 
20 
DNA 

Artificial Sequence 
<220> 

<2 23> synthetic construct 
<400> 309 

aaatcgatgt ggaaggtcag 

<210> 310 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

. <223> synthetic construct 
<400> 310 

gaccttccac atcgatttgg 

;<21'0> 311 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> . 

! <22 3> synthetic construct 
1 <400> 31l' . 

| aaatcgatgt Iggtgctgtag ; 

<210> |312 
;<211> 2 0 

<212> DNA : 
,<213> Artificial Sequence 

<220> •' 1 

j<22 3> synthetic construct 

i<400> 312 

acagcaccac ; atcgatttgg 

:<210> 313 ' : 
<211> 9 ; s 

: <212> DNA 

<213> Artificial Sequence 
<220> ' 

;<22 3> .synthetic construct 



<210> 
;<2ii> 

<212> 
<213> 
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20 



20 



20 



20 
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<400> 313 '! 

! 

gttgcctgt 

' i 

<210> 314 : 
<211> 9 
<212> DNA ! 

<213> Artificial Sequence 

<220> : 

<2 23> syrithetic construct 

<400> 314 ; 
aggcaacct ' 

<210> 315 ; 
<211> ,9 \ ' '. 
<212> DNA } 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 315 : 
caggacggt 

<210> 316 
<2X1> 9 
<212> DNA : 

<213> Artificial Sequence 

<220> ' ! 

<223> syrithetic construct 

<400> 316 : 
cgtcctgct 

<210> 317 : 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> , -| 

<223> synthetic construct 

<400> .317 ! ; 
agacgtggt 

<210> 318 ' 
<211> 9. 

<212> DNA j , : 
<213> Artificial Sequence 

<220> ;.»'::''■' " \ 
<223> synthetic! construct 

<400> 318 
cacgtctct 

<210> 319 
<211> 9 • ! • 



PCT/US2004/042964 



9 



9 



9 



9 



9 
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<212> DNA ! ; 

<213> Artificial Sequence 

<220> 

<22 3> synthetic construct 

<400> 319 
caggaccgt 

<210> 320 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> , 

<223> synthetic construct 

<400> 320 
ggtcctgct . 

<210> 321 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 321 
caggacagt 

<210> 322 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 322 
tgtcctgct 

<210> 323 : 
<211> 9 : ' 
<212> DNA 

<213> Artificial Sequence 
<220> 

<c22 3> synthetic construct 

<400> 323 
cactctggt 

<210> 324 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



PCT/US2004/042964 



9 



9 



9 



9 
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<400> 324 
cagagtgct 



9 



<210> 325 

<211> 9 

<212> DNA : 

<213> Artificial Sequence 

<220> 

<223> synthetic construct 
,<40,0> 325 . '■ . 

gacggctgt 9 

<210> 326 

<211> 9 

<212> DNA ; i 

<213> Artificial Sequence 

<220> 

<223> synthetic construct 
<4 00> 32 6 

agccgtcct 9 

<210> 327 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
;<220> 

;<223> synthetic construct 
<400> 327 

cactctcgt 9 

<210> 328 
<211> 9 
<212> DNA : 

;<213> Artificial Sequence 



<220> ; 

;<223> synthetic construct 



<400> 328 
gagagtgct 



9 



<210> 329 

:<211> 9 : ; 

<212> DNA 

•<213> Artificial Sequence 



; <220> 

<223> synthetic : construct 



<400> 32 9 
•gtagcctgt 



9 



<210> 330 
<211> 9 
;<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> synthetic construct 

■ 

<400> ; 330 
aggctacct '. 

<210> ,331 
<211> ; 5 * 
<212> DNA 

<213> Artificial Sequence 
<220> : 

<223> isynthetic construct 

<400> 331 
gccacttgt 

<210> 332 
<211> 9 
' <212> DNA 
<213> Artificial Sequence 

<220>, ; 

<223> synthetic construct 

<400> 332 
aagtggcct : 

<210> |333 ; 
<211> .|9 
<212> ;DNA 

<213> Artificial Sequence 
<220> ; 

<223> jsynthetic construct 

<400> ;333 
catcgqtgt , 

<210> |334 . 
<211> *9 
<212> DNA : 

<213> Art i if icial /Sequence 

i ' * 

; <220> ; ' 
<223 > ■ ^synthetic construct 

<400> 334 
' agcgatgct 

<2io>:;335 

t <211> 9 
■ <212> DNA 
<213> Artificial Sequence 

" i . 

<220> 

<223> synthetic construct 
: <400>. 335 
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9 



9 



9 



• 9 



61/162 



WO 2005/058479 

cactggtgt 

<210> 3 ! 36 ; 
<211> 9; [" 
< 2 1 2 > DNA ' 

<213> Artificial; Sequence 

<220> ■ 

<22 3> synthetic construct 

<400> 336 

accagtgct : \ 
<210> 337 

<211> 9 : • 

<212> DNA !; 

<213>; Artificial 1 Sequence 

<220> 

<22Z> synthetic construct 

<400> 337 
gccactggt 

<210> 338 
<211> 9 
<212> : DNA 

<2 13 > ; Artificial Sequence 
<220>. 

<223> synthetic construct 

<4 00> 33 8 ' ■ 

cagtggcct 

<210> 339 
<211> 9 

<212 > ' DNA , 
<213> . Artificial Sequence 

<220>, • i 

<223>„ synthetic construct 

<400> ' 339 ■ ■■ j: : ; 
tctggctgt ;j j ; 

1 -j : . ■ 

<210> : 340 
<211> 9 i 

<212>dna : : 

<213> , Artificial; Sequence 

<220> . ; i 

<2 23> synthetic construct 

<400> 340 
agccagact 

<2\0> 341 : : 

<211> 9 

<212> DNA : ; . ! : 

<213> Artificial! Sequence 



PCT/US2004/042964 
9 



9 



9 



9 



9 
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; j 

<22o> :!'.;;■ 

<22 3> synthetic construct 

<400> 341 ; 
gccactcgt 

<210> 342 
<211> 9 [ i 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 342 
gagtggcct 

j <210> 343 

: <2ii> 9 

<212> DNA 

<213> Artificial Sequence 

<220> , ; [ 

<223> synthetic construct 

<400> 343 
tgcctctgt 

<210> 344 
<211> 9 
<212> DNA '. 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 344 
a ga99cact ; 

<210> 345 ! | 

<2n> 9 ; 

<212> DNA ' : 
j. <213> Artificial Sequence 

:• <220> ' 1 : 

<223> synthetic construct 

<400> 345 ! 
catcgcagt i 

<210> 346 
<211> 9 \ \ 

<212> DNA 

<213> Artificial Sequence 

<220> ' 

<223> synthetic construct 

<400> 346 
tgcgatgct 



PCT/US2004/042964 



9 



9 



9 



9 



9 
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<210> 347 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic : construct 

<400> 347 
caggaaggt 

<210> 348 

<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 348 
cttcctgct 

<210> 349 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 349 
ggcatctgt 

<210> 350 
<211> 9 
<212> DNA 

<213> Artificial Sequence 

<220> ' . 

<223> synthetic construct 

<400> 350 
' agatgccct 

<210> 351 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic! construct 

<400> 351 
c 99tgctgt 

<210> 352 
<211> 9 
<212> DNA 

<213> Artificial .Sequence 
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9 



9 



9 



9 
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<220> ! 

<223> synthetic iconstruct 



<400> 352 i 
agcaccgct . , 



9 



<210> 353 

<211> 9 ' ' 

<212> DNA : i 

<213> Artificial Sequence 

<220> 

<22 3> synthetic iconstruct 
<400> 353 

cactggcgt 9 
<210> 354 

<2ii> 9 : j 

<212> DNA 

<213> Artificial Sequence 



<220> . 

<223> synthetic construct 

<400> 355 
tctcctcgt 

<210> 35;6 ' 

<211> 9 • I • 

<212> DNA . : 

<213> Artificial Sequence 

. [ ; i 

<220> ; ; 



<220> 

<223> synthetic 'construct 



<400> 354 
gccagtgct 



9 



<210> 355 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<223> synthetic iconstruct 



<400> 356 
gaggagact 



9 



<210> 357 ' 
<211> 9 
<212> DNA ■ j 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 357 
cctgtctgt 



9 
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<210> 358 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> synthetic construct 

<400> 358 
agacaggct 

<210> 359 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 359 
caacgctgt 

<210> 360 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 360 
agcgttgct 

<210> 361 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 361 
tgcctcggt 

<210> 362 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 362 
cgaggcact 

<210> 363 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



66/162 
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<220> 

<2 23> synthetic construct 

<400> 363 
acactgcgt ; . 

<210> 364 

<211> 9 ; 

<212> DNA i 

<213> Artificial 1 Sequence 

<220> 

<223> synthetic construct 

<400> 364 
gcagtgtct 

<210> 365 

<211> 9 ; 

<212> DNA i . 

<213> Artificial Sequence 

<220> 

<22 3> synthetic construct 

<4 00> 365 
tcgtcctgt 

<210> 366 
<211> 9 
<212> DNA 

<213> Artificial; Sequence 
<220> 

<223> synthetic construct 

<400> 366 
aggacgact 

<210> 367 
<211> 9 
<212> DNA { 

<213> Artificial; Secjuence 

■ i. 
<220> ■ f 

<223> synthetic construct 

<400> 367 
gctgccagt 

I i i 

<210> 368 
<211> 9 
<212> DNA 

<213> Artificial; Sequence 
<220> 

<223> synthetic construct 

<400> 368 ;' I 
tggcagcct '. 
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9 



9 



9 



9 



9 
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<210> 369 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 369 
tcaggctgt 



9 



<210> 370 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 370 

agcctgact 9 

<210> 371 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<210> 372 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 372 

acctggcct ; • ■$ 

<210> 373 • : 
<211> 9 
<212> DNA . 

<213> Artificial Sequence 



<220> 

: <223> synthetic construct 



<400> 371 
gccaggtgt 



9 



<220> 

<223> synthetic construct 



<400> 373 
c 99acctgt 



9 



<210> 374: 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> synthetic construct 

<400> 374 
aggtccgct 

<210> 375 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 375 
caacgcagt 

<210> 376 
<211> 9 
<2 12 > DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 376 
tgcgttgct 

<210> 377 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 377 
cacacgagt 

<210> 378 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 378 
tcgtgtgct 

<210> 379 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 379 
atggcctgt 

<210> 380 
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9 



9 



9 



9 



9 
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<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<2 23> synthetic construct 



<400> 380 
aggccatct 



9 



<210> 381 

<211> 9 ; • 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<4oo> 381 ; 

ccagtctgt 9 

; <210> 382 

<211> 9 

<212> DNA 
: <213> Artificial Sequence 

<220> 

<223> synthetic construct 
: <400> 382 ... 

agactggct 9 

<210> 383 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<2 23> synthetic construct 



<400> 383 
gccaggagt 



9 



<210> 384 • ! : 

<211> 9 J 

<212> DNA . j 

<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 



<400> 384 
tcctggcct 



9 



<210> 385 ' 
<211> 9 
<212> DNA ; 

<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 
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<400> 385 
cggaccagt 



9 



<210> 386 ; 
<211> 9 

<212> DNA . . : ,' 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 386 
tggtccgct 



9 



<210> 387 
<211> 9 I 
<212> DNA : 

<213> Artificial Sequence 
<220> 

<223> synthetic! construct 
<400> 387 

ccttcgcgt : 9 

<210> 388 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 388 : 
gcgaaggct 9 

<210> 389 : 
<211> 9 
<212> DNA . 

<213> Artificial Sequence I 



<220> 

<223> synthetic construct 



<400> 389 : 
gcagccagt 



9 



<210> 390 
<211> 9 

<212> DNA , - ; 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 390 

tggctgcct 



9: 



<210> 391 
<211> 9 



I- 
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<212> DNA : 

<213> Artxf icia!l Sequence 

<220> ;■ ; 

<22 3 > synthetic construct 

<400> 391 
ccagtcggt 

<210> 392 ; 
<211>. 9 

<212> DNA ;: ; 

<213> Artificial Sequence 

<220> 

<223> synthetic construct 

<400> 392 
cgactggct 

<210> 393 
<211> 9 
<212>DNA 

<2 13 > Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 393 
actgagcgt . 

<210> 394 
<211> 9 
<212> DNA : 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 

<400> 394 
gctcagtct 

<210> 395 ; ■ : 
<211> 9 

<212> DNA ; ! 

<213> Artificial Sequence 

<220> 

<223> synthetic construct 

<400> 395 '! 
ccagtccgt 

<210> 396 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
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9 



9 



9 



9 
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<400> 396 
ggact'ggct 

<210> 397 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400>; 397 
ccagtcagt 

<210>; 398 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 398 
tg.actggct 

<210> 399 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 399 
catcgaggt 

<210> 400 
<211> 9 ; 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 400 . ; 
ctcgatgct 

<210> 401 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 401 
ccatcgtgt ' .. 

<210> 402 
<211> 9 
<:212> DNA 



PCT/US2004/042964 

9 

9 
9 
9 
9 
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<213> Artificial Sequence 



<220> 

<22 3> [synthetic construct 



<400> '402 
acgatggct 



9 



<2io> :403 ; 

<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> jsynthetic construct 
<400> ;403 

gtgctgcgt . ! 9 

<210> 404 
<211> "9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 
<400> ;404 

gcagcacct 9 

<210> -405 

<211> '9 

<212> DNA ! 

<213> iArtificial Sequence 



<220> 

<223> synthetic construct 



<400> 405 
gactacggt 



9 



< 210> j4 06 
<211> |9 
<212> iDNA 

<213> Artificial Sequence 



<22o> : 

<223> isynthetic construct 



<400> 406 
cgtagtcct 



9 



<210> 1407 

<2ii> 9 : 

<212> DNA 

<213> Artificial Sequence 



<220> ; . .. 

<223> isynthetic construct. 



<400> ;407 
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gtgctgagt 

i 

<210> 408 
<211> 9 
<212> DNA , 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 408 
tcagcacct 

<210> 409 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> synthetic construct 

<400> 409 
gctgcatgt t 

<210> 410 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 410 
atgcagcct 



<210> 411 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 411 
gagtggtgt 

<210> 412 | 

<211>:9 

<212> DNA . 

<213> Artificial Sequence 
<220> ' . : 

<223> synthetic construct 

<400> 412 ".. 
accactcct 

<210> 413 
<211> 9 
<212> DNA '. 
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<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 413 
gactaccgt 

<210> 414. 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> . 

<223> synthetic construct 

<400> 414 
ggtagtcct 

<210> 415 

<211> 9 ; 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 415 
cggtgatgt 

<210> 416 i 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 416 
atcaccgct 

<210> 417 
<211> 9 

<212> DNA ; j 

<213> Artificial' Sequence 

<220> 

<223> synthetic -construct 
<400> 417 

tgcgactgt ; 
<210> 418 

<211> 9 : 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 418 ' 
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9 



9 



9 
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agtcgcact 

<210> 419 
<211> 9 ; 
<212> DNA 

<213> Artificial Sequence 
:<220> 

<223> synthetic construct 

<400> 419 i 
tctggaggt; 

<210> 420 1 
<211> 9 
<212> DNA; 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 420 
ctccagact 



<210> 421 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 421 ; 
agcactggt 

<210> 422 
<211> 9 . 
<212> DNA ! 

<213> Artificial Sequence 
<22 0> 

<223> synthetic construct 

<400> 422! I 
cagtgctct 

<210> 423 
<211> 9 
<212> DNA ; 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 423 
tcgcttggt; 

<210> 424" 
<211> 9; 
<212> DNA ! 
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<213> ' Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 424 
caagcgact 

<210> 425 
<211> . 9 
<212>:DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 425 
agcactcgt 

<210> 426 
<211> 9 
<2 12 > DNA 

<213> . Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 426 
gagtgctct 

<210> 427 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400>;427 
gcgattggt 

<210> 428 

<211>;9 

<212>;DNA 

<213> i Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 428 
caatcgcct 

<210>:429 
<211>; 9 
<2 12 > • DNA 

<2 13 > Artificial Sequence 

<220> j 

<223> synthetic, construct 

<400>429 
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ccatcgogt 



9 



<210> 430 
<211> 9 
<212> DNA ; 

<213> Artificial, Sequence 
<220> 



<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 431 

tcgcttcgt 9 

<210> 432 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 432 

gaagcgact 9 

<210> 433 
<211> 9 
<212> DNA 



<22 3> synthetic construct 



<400> 430 
gcgatggct 



9 



<210> 431 



<213> Artificial Sequence 



<220> ; 

<22 3> synthetic construct 



<400> 433 
agtgcctgt 



9 



<210> 434 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 434 
aggcactct 



9 . 



<210> 435 
<211> 9 
<212> DNA 

<213> Artificial Secjuence 
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;<220> ; 

. <223> synthetic construct 

i ' 

I <400> 435 ; 
; ggcataggt ; 

' <210> 436 : 
<211> 9 
<212> DNA , 

<213> Artificial Sequence 
<220> 

,<223> synthetic construct 

; <400> 436 
; ctatgccct 

;<210> 437 ' 
<211> 9 
<212> DNA ; 

<213> Artificial Sequence 
:<220> 

,<223> synthetic construct 

i<400> 437 : 
gcgattcgt 

<210> 438 
<211> 9 
<212> DNA : 

l<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 438 ; 
igaatcgcct 

<210> 439 j 
;<2ri> 9 ; 
|<212> ;DNA | 

<213> Artificial Sequence 
<220> I 

<223> synthetic construct 

<400> 439 
i tgcgacggt 

i I 
j<2l6> 440 ; 

<211> 9 S ■ • ■ 
;<212> DNA j 

;<213> Artificial Sequence 

<22o> ; 

*<223> synthetic construct 



'l <400> 440 ; 



; cgtcgcact 
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<210> 441 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 441 
gagtggcgt 

<210> 442 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 442 
gccactcct 

<210> 443 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 443 
cggtgaggt 

<210> 444 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 444 
ctcaccgct 

<210> 445 
<211> 9 
<212> DNA ■ 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 445 
gctgcaagt 

<210> 446 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> synthetic construct 



<400> 446 
ttgcagcct 



9 



<210> 447 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 



<400> 447 
ttccgctgt 



9 



<210> 448 
<211> 9 
<212> DNA; 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 448 

agcggaact 9 

<210> 449 

<211> 9 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 449 

gagtggagt 9 

<210> 450: 
:<2ll> ;9 
<212> • DNA 

<213> Artificial Sequence 



,<220> 

<223> synthetic construct 



<400> 450 
tccactcct 



9 



<210> 451 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 451 



i 

82/162 
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acagagcgt : 

<210> 452 ; 

<211> 9 : '' , ; 

<212> DNA . 

<213> Artificial ^.Sequence 

. <220> , ■ 

<223> synthetic construct 

<400> 452 
gctctgtct 

<210> 453 
<211> 9 
<212> DNA ■ 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 453 
; tgcgaccgt ; ' 

<210> 454 
<211> 9 

<212> DNA • ! ■ 

<213> Artificial Sequence 

<220> 

<22 3> synthetic construct 

: <400> 454 : 
ggtcgcact 

<210> 455 
<211> 9 
<212> DNA ' 

<213> Artificial Sequence 

! i 
<220> ' •'. 

<223> synthetic construct 

<400> 455 ; ' ■ ■ 
cctgtaggt ; 

<210> 456 « ■ 

<211> 9 ''•]•[".'• 
<212> DNA ; 
: <213> Artificial Sequence 

<220> | : 

<223> synthetic construct 

<400> 456 : , " 
ctacaggct ■■• 

• 1 

<210> 457 , 

<211> 9 

<212> DNA • 
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<213> Artificial Sequence 

<220> . : 

<223> synthetic construct 

<400> 457 ■ 
tagccgtgt j 

<210> 458 

<211> 9 

<212> DNA ; 

<213> Artificial Sequence 

<220> 

<223> synthetic construct 

<400> 458 
acggctact 

<210> 459 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 459 
tgcgacagt 

<210> 46.0 
<211> 9 
<212> DNA 

<213> Artificial Sequence: 
<220> 

<223> synthetic construct 

<400> 460 
tgtcgcact 

<210> 461 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct' 

<400> 461 
ggtctgtgt 

<210> 462 ; 

<211> 9 

<212> DNA : : 

<213> Artificial Sequence 

<220> ; i 

<223> synthetic construct 

<400> 462 
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acagaccct 



9 



<210> 463 
<211> 9 
<212> DNA ' ; 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 463 

cggtgaagt' 9 

<210> 464 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<210> 465 
<211> 9 
<212> DNA ! 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 
<400> 465 

caacgaggt 9 

<210> 466 
<211> 9 
<212> DNA | 

<213> Artificial Sequence 



<400> 466 I ; 

ctcgttgct 1 ! 9 

<210> 467 ' 
<211> 9 i | 
<212> DNA ' 

<213> Artificial Sequence 
<220> j 

<2 23> synthetic construct 
<400> ,467 

gcagcatgt 9 

<210> 468 ' 
<211> «9 
<212> DNA ! 



<220> ; 

<223> synthetic : construct 



<400> 464 
ttcaccgct 



9 



<220> : 

<223> synthetic construct 
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<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 



<400> 468 
atgctgcct 



9 



<210> 469 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 



<400> 469 
tcgtcaggt 



9 



<210> 470 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 470 

ctgacgact 9 

<210> 471 
<211> 9 
<2 12 > DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<210> 472 
<211> 9 
<212> DNA i. 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 472 I ■ 

tggcactct 9 

<210> 473 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct, 
<400> 473 



<400> 471 
agtgccagt 



9 
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tagaggcgt 

<210> 474- 
<211> 9 I 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 474 
gcctctact 

<210> 475 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

;<223> synthetic construct 

<400> 475 
gtcagcggt 

<210> 476 ! 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 476 
cgctgacct 

:<210> 477 
<211> 9 
<212> DNA 

:<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> All 
tcaggaggt 

<210> 478 

<211> 9 

;<212> DNA 

<213> Artificial Sequence 

;<220> 

'<223> synthetic construct 

<400> 478 
ctcctgact 

<210> 479 
<211> 9 
i<212> DNA 

<213> Artificial Sequence 
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<220> 

<22 3> synthetic construct 



<400> 479 
agcaggtgt 



9 



<210> 



480 



<211> 9 
. <212> DNA 
<213> Artificial Sequence 

<220> 

<223> synthetic construct 
<400> 480 

acctgctct 9 

<210> 481 i 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 481 

ttccgcagt 9 

<210> 4 82 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 



<400> 482 
tgcggaact 



9 



<210> 483 
<211> 9 
<212> ;DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> :483 
gtcagccgt 



9 



<210> 484 ; 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 4 : 84 
ggctgacct 



9 
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<210> 485 



<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 485 

ggtctgcgjt 9 

<210> 486 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<210> 487 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<22 0> 

<22 3> synthetic construct 
<400> 487 

tagccgagt 9 

<210> 488 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 



<400> 486 
gcagaccct 



9 



<220> 

<223> synthetic construct 



<400> 488 
tcggctact 



9 



<210> 489 

<2ii> 9 , : 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 489 
gtcagcagt 



9 



<210> 490 

<211> 9 

<212> DNA .; . 

<213> Artificial Sequence 
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<220> 

<223> synthetic construct 



;<400> 490 
itgctgacct 



9 



<210> 491 
;<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
:<400> 491 

ggtctgagt 9 

<210> 492 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 492 

tcagaccct 9 

<210> 493 : 
<211> 9 . 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 493 

cggacaggt 9 

:<210> 494 
<211> 9 
;;<212> DNA : 



;<400> 494 
ctgtccgct 

<210> 495 
<211> 9 
;<212> DNA 

; <213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
:<400> 495 



<213> Artificial Sequence 



<220> 

; <223> synthetic construct 



ttagccggt 



9 
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! i • . 

<210> 496 • . 

<2ii> 9 ■ [ s " 

<212> DNA } . 

<213> Artificial. Sequence 

<220> 

<223> synthetic construct 

<400> 496 
cggctaact 

<210> 4 97 , 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 497 
gagacgagt 

<210> 498 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 498 
tcgtctcct 

<210> 499 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 

<40 0> 4 99 
cgtaaccgt 

<210> 500 
<211> 9 ; 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic bonstruct 

<400> 500 
ggttacgct 

<210> 501 
<211> 9 
<212> DNA . 

<213> Artificial Sequence. 
<220> i 
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<22 3> synthetic construct 



<400> 501 

ttggcgtgt 9 

<210> 502 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 1 

<223> synthetic construct 
<400> 502 

acgccaact 9 

<210> 503 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<210> 504 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 504 

ctgccatct 9 

<210> 505 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 503 
atggcaggt 



9 



<220> 



<223> synthetic construct 



<400> 505 
cagctacga 



9 



<210> 506 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 506 
gtagctgac 



9 
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<210> 507 
<211> 9 
<212> DNA 

<213> Artificial Sequence 

<220> i 
<223> synthetic construct 

<400> 507 
ctcctgcga 

<210> 508 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 508 
gcaggagac 

<210> 509 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 509 
gctgcctga 

<210> 510 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 510 
aggcagcac 

<210> 511 
<211> 9 
I <212> DNA 
<213> Artificial Sequence 

<220> 

<223> synthetic construct 

; <400> 511 
caggaacga 

<210> 512 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> synthetic construct ; 



<400> 512 
gttcctgac 



9 



<210> 513 
<211> 9 
<212> DNA 

<213> Artificial Sequence . 
<220> 

<223> synthetic construct ; 
<400> 513 

cacacgcga 9 
<210> 514 

<211> 9 i 
<212> DNA 

<213> Artificial Sequence \ 



<220> 

<223> synthetic construct 



<400> 514 
gcgtgtgac 



9 



<210> 515 
<211> 9 
<212> DNA 

<213> Artificial Sequence 1 



<220> 

<2 23> synthetic construct 



<400> 515 
gcagcctga 



9 



<210> 516 
<211> 9 
<2l2> DNA 

<213> Artificial Sequence! 



<220> 

<223> synthetic construct . 



<400> 516 
aggctgcac 



9 



<210> 517 

<211> 9 ; 
<212> DNA 

<213> Artificial Seiquence » 



<220> 

<223> synthetic construct 



<400> 517 
ctgaacgga 



9 
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<210> 518 
<211> 9 
<212> DNA. 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 518 
cgttcagac 

<210> 519 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 519 
ctgaaccga 

<210> 520 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 520 
ggttcagac 

<210> 521 
<211> 9 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 521 
tctggacga 

<210> 522 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 522 ; 
gtccagaac 

<210> 523 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> synthetic construct : 



<400> 523 
tgcctacga 



9 



<210> 524 

<211> 9 ' 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 524 
gtaggcaac 



9 



<210> 525 

<211> 9 : 
<212> DNA 

=<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 525 

ggcatacga 9 

<210> 526 
<211> 9 
<212> DNA ' 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 526 

gtatgccac 9 
<210> 527 

<211> 9 ■ ■ J. 

<212> DNA . 



<213> Artificial Sequence ; 



<220> 

<223> synthetic construct : 



<400> 527 
cggtgacga 



9 



<210> 528 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



•i 



<400> 528 
gtcaccgac 



9 
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<210> 529 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 529 
caacgacga 



9 



<210> 530 
<211> 9 
<212> DNA • 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 
<400> 530 

gtcgttgac 9 

<210> 531 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 531 

ctcctctga 9 

<210> 532 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<2 23> synthetic construct 



<400> 532 
agaggagac 



9 



<210> 533 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 533 
tcaggacga 



9 



<210> 534 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> synthetic construct 

<400> 534 ' 
gtcctgaac 

:<210> 535 

<211> 9 \ 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

i 

<400> 535 
aaaggcgga 

;<210> 536 
<211> 9 
<212> DNA; 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 536 
cgcctttac 

<210> 537 : . 
<211> 9 
<212> DNA! 

<213> Artificial Sequence 

<220> ; 
<223> synthetic construct 

<400> 537 
ctcctcgga 

i<210> 538 . 
:<211> 9 

;<212> DNA | 
;<213> Artificial Sequence 

' i 

j<220> I _ , 

, ;<223> synthetic construct 

;<400> 538 
cgaggagac ; 

<210> 539 

<211> 9 ! 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400^ 539 
cagatgcga • 

; i i 
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9 



9 



9 



9 



9 
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<210> 540 

<211> 9 ■ : 

<212> DNA .. . . 

<213> Artificial Sequence 

<220> 

<223> synthetic construct 

<400> 540 
gcatctgac 

<210> 541 ; 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 541 : 
gcagcaaga 

<210> 542 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 542 
ttgctgcac 

<210> 543 ' 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 543 ; 
gtggagtga : 

<210> 544 .: ■ 5 : 
<211> 9 : . 

<212> DNA ! 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 544 : J 
actccacac . 

<210> 545 
<211> 9 I 

<2i2> dna : I : 

<213> Artificial Sequence 

<22o> : : 
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<22 3> synthetic construct 

<400> 545 
ccagtagga 

<210> 546 ; | 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 546 
ctactggac 

<210> 547 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 547 ! 
atggcacga 

<210> 548 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 548 
gtgccatac 

<210> 549 ; j 
<211> 9 L j 

<212> DNA . | 
<213> Artificial Sequence 

<220> , . ; 

<223> synthetic construct 

<400> 549 
ggactgtga 

<210> 550 
<211> 9 
<212> DNA 

<213> Artificial Sequence 

| 

<220> " : 

<223> synthetic construct 

<400> 550 
acagtccac 

<210> 551 
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9 
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9 



i 
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<211> 9 , 
<212> DNA^ i 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<4oo> 551 ! 

ccgaactga; j 

<210> 552: 
<211> 9 • 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 

<4 00> 552. : 
agttcggac •> 

<210> 553 ; 
<211> 9 
<212> DNA i 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 553 
ctcctcaga 

<210> 554 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 

<400> 554; 
tgaggagac . 

<210> 555; 

<2ii> 9 ' : 

<212> DNA 

<213> Artificial Sequence 
<220> = i 

<223> syrithetic construct 

<400> 555 
cactgctga ! 

<210> ,556 
<211> 9 
<212> DNA :' 

<213> Artificial Sequence 

<220> '■.!■■•' 

<2 23> ; synthetic construct 
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<400> 556 
agcagtgac 



9 



<210> 557 



<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 557 

agcaggcga 9 

<210> 558 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> synthetic construct 
<400> 558 

gcctgctac 9 

<210> 559 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<210> 560 
<211> 9 
<212> DNA 

<2 13 > 'Artificial Sequence 

<220> . ; 

<223> synthetic construct 

<400> 560 

tcctgctac .9 

<210> 561 
<211> 9 
<212>,DNA 



<220> 

<22 3> synthetic construct 



<400> 559 
agcaggaga 



9 



<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 



<400> 561 
agagccaga 



9 



<210> 562 
<211> 9 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 562 
tggctctac 

<210> 563 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 563 
gtcgttgga 

<210> 564 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 564 
caacgacac .• . . 

<210> 565 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 565 
ccgaacgga 

<210> 566 

<2ii>. 9 • . " ; , 

<212> DNA' 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 566 
cgttcggac 

<210> 567 
<211> 9 • 
<212> DNA 

<213> Artificial Sequence 

<220> ; . ; 

<223> synthetic construct 
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j <400> 567 
; cactgcgga 



9 



<210> 568 
: <211> 9 - 

<212> DNA ; 
1 <213> Artificial Sequence 



<220> 

<223> synthetic construct 



: <400> 568 
cgcagtgac 



9 



.* <210> 569 



<211> 9 
<212> DNA • 

<213> Artificial Sequence 
1 <220> 

<223> synthetic construct 
j <400> 569 

gtggagcga 9 

<210> 570 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 570 

gctccacac 9 

<210> 571 
: <211> 9 

<212> DNA 
. <213> Artificial Sequence 



<220> 

i <223> synthetic construct 



• <400> 571 
gtggagaga 



9 



<210> 572 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400>. 572 
tctccacac 



9 



: <210> 573 
; <211> 9 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 573 
ggactgcga 

<210> 574 
<211> 9 

<212> DNA ! 
<213> Artificial Sequence 

<220> 

<223> synthetic construct 

<400> 574 
gcagtccac 

<210> 575 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 

<400> 575 
ccgaaccga 

<210> 576 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 576 
ggttcggac 

<210> 577 
<211> 9 

<212> DNA ;. 
<213> Artificial Sequence' 

<220> 

<223> synthetic construct 

<400> 577 
cactgccga 

<210> 578 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 578 
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ggcagtgac 



9: 



<210> 579 ■ 
<211> 9 
<212> DNA ; 

<213> Artificial Sequence 



<220> ; 

<223> synthetic construct 



<400> 579 
cgaaacgga 



9 



<210> 580 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 580 

cgtttcgac 9 

<210> 581 ' 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<210> 582 
<211> 9 

<212> DNA . , . 

<213> Artificial Sequence 

<220> 

<223> synthetic construct 
<400> 582 ; ; • 

tcagtccac . ■' ' 9 

<210> 5iB3 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 581 
ggactgaga 



9 



<220> 



<223> synthetic construct 



<400f 583 
ccgaacaga 



9 



<210> 584 
<211> 9 
<212> DNA , 

<213:> Artificial Sequence 
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<22 0> 

<223> synthetic construct 

<400> 584 
tgttcggac 

<210'>:'5;85 
<211> 9 
<212> DNA ■ 

<213> Artificial Sequence : 
<220> 

<22 3> synthetic construct 

<400> 585 
cgaaaccga 

<210> 586 
<211> 9 
<212> DNA 

<213> Artificial Sequence 1 
<220> ' ; 

<22 3> synthetic construct 

<400> 5!86 
ggtttcgac 

<210> 587 
<211> 9 
<212> DNA 

<213> Artificial Sequence ; 
<220> 

<223> synthetic construct 

<400> 5:87 
ctggcttga 

<210> 588 
<211> 9 _ 
<212> DNA , 

<213> Artificial Sequence ; 
<220> 

<223>. synthetic construct 

<400> 5:88 • 
aagccagac 

<210> 5 89 ; 
<211> 9^ 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 589 
cacacctga 
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<210> 590 
<211> 9 
<212> DNA 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 590 
aggtgtgac 



9 



<210> 591 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 591 

aacgacega 9 

<210> 592 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<210> 593 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 593 



<220> 

<223> synthetic construct 



<400> 592 
ggtcgttac 



9 



atccagcga 



9 



<213> Artificial Sequence 



<210> 594 
<211> 9 
<212> DNA 



<220> 

<22 3> synthetic construct 



<400> 594 
gctggatac 



9 



<210> 5 95 
<211> 9 

<212> DNA I 
<213> Artificial Sequence 
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<22o> ,; 

<223> synthetic ■ ; construct 



<400> 595 
tgcgaagga 



9 



<210> 596 
<211> 9 , 
<212> DNA 



<213> Artificial Sequence 



<220> :i 

<223> synthetic ^construct 



<400> 596 
cttcgcaac 



9 



<210> 597. 
!<211> 9 : j 
<212> DNA 

<213> Artificial Sequence 



<220> : " :i . 

<22 3> synthetic iconstruct 



<400> 597 
tgcgaacga 



9 



<210> 598 
<211> 9 * 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 598 

gttcgcaac 9 
<210> 599 



<211> 9 : 

<212> DNA • ; j 

<213> Artificial Sequence 




<400> 599 
ctggctgga 



<210> 600 

<211> 9 , 

<212> DNA ; 

<213> Artificial Sequence 



<220> ■ ' 

<223> synthetic Iconstruct 



<400> 600' 
cagccagac 



9 
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<210> 601 

<211> 9 i 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic ; construct 



<400> 601 
cacaccgga 



9 



<210> 602 • 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 602 
cggtgtgac 



9 



<210> 603 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic : construct 
<400> 603 

agtgcagga 9 
<210> 604 



<210> 605 
<211> 9' 
<212> DNA 

<213> Artificial Sequence 

<220> ' . 

<i223> synthetic ; construct 

<400> 605 

gaccgttga 9 

<210> 606 

<211> 9 . 

<212> DNA 

<213> Artificial Sequence 



<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> synthetic ■ construct 



<400> 604 
ctgcactac 



9 
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<220> 

<22 3> synthetic construct 

<400> 606 
aacggtcac 

<210> 607 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 607 
ggtgagtga 

<210> 608 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 608 
actcaccac 

<210> 609 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 609 
ccttcctga 

<210> 610 
<211> 9 
<212> DNA ■ 

<213> Artificial Sequence 

<220> : ; 

<223> synthetic construct 

<400> 610 
aggaaggac 

<210> 611 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 611 
ctggctaga 
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<210> 612 s : 

<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 612 
tagccagac 

<210> 613 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 

<400> 613 
cacaccaga 

<210> 614 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 614 
tggtgtgac 

<210> 615 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 615 
agcggtaga 

<210> 616 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 616 
taccgctac 

<210> 617 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> : ; 
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<223> synthetic construct 



<400> 617. 
gtcagagga 



9 



<210> 618 : , 

<211> 9 
<212> DNA 

=<213> Artificial Sequence 



<220> 

<223> synthetic: construct 



<400> 618 
ctctgacac 



9 



<210> 619 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 619 

ttccgacga 9 

<210> 620 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 620 

gtcggaaac 9 

<210> 621 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic' construct 



<400> 621 
aggcgtaga 



9 



<210> 622 

<211> 9 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 622 
tacgcctac 



9 



<210> 623 
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<211> 9 
< 2 1 2 > DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 623 
ctcgactga 

<210> 624 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 624 
agtcgagac 

<210> 625 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 625 
tacgctgga 

<210> 626 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 626 
cagcgtaac 

<210> 627 

<211> 9 1 ' . ; 
<212> DNA ': 

<213> Artificial Sequence 

<220> 

<223> synthetic construct 

<400> 627 ; 
gttcggtga 

<210> 628 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> , 

<223> synthetic construct 
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<400> 628 
accgaapac 

<:210> 629. 
<211> 9 
<212> DNA. 

<213> Artificial Sequence 

<220> I 

<223> synthetic construct 

<400> 629 
gccagcaga 

<210> 630 
<211> 9 
<212> DNA 

<213> Artificial Sequence 

<220> , 

<22 3> synthetic construct 

<400> 630 
tgctggcac 

<210> 631 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> iS31 
gaccgtaga 

<210> 632 
<211> 9 
<212> DNA 

<213> Artificial Sequence 

<220> ' ;'i • 

<223> synthetic construct 

<400> '632 
tacggtcac 

<210> 633 
<211> 9 
<212> DNA, 

<213> Artificial Sequence 

<220> ! 
<223> synthetic construct 

<400> 633 . : 
gtgctctga : 

<210> 634;: 
<211> 9 
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<212> DNA ; j 
<213> Artificial Sequence 

<220> 

<223> synthetic construct 
<400> 634 

agagcacac 9 

<210> 635 , 
<211> 9 
<212> DNA : 

<213> Artificial -Sequence 
<220> 

<22 3> synthetic construct 
<400> 635 : 

ggtgagcga 9 

<210> 636 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 636 
gctcaccac 

<210> 637 
<211> 9 
<212> DNA : 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 637 ■' 
ggtgagaga ; 

<210> 638 ! 
<211> 9 ; 
<212> DNA : : 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 638 ' 
tctcaccac ! 

<210> 639 ; 
<211> 9 ; ! 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
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<400> 639 1 
ccttccaga ■' '. 

<210> 640 
<211> 9 I 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 640 
tggaaggac 

<210> 641 
<211> 9 • 
<212> DNA 

<213> Artificial Sequence 

<220> i 

<22 3> synthetic construct 

<400> 641 . 
ctcctacga \ 

<210> 642 
<211> 9 
<212>DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 642 
gtaggagac 

<210> 643 

<211> 9 

<212> DNA : 

<213> Artificial Sequence 

<220> r ; 

<22 3> synthetic construct 

<400> 643 ' 
ctcgacgga 

<210> 644 
<211> 9 
<212> DNA 

<213> Artificial Sequence 

<220> ■ }■ ■ 

<22 3> synthetic construct 

<400> 644 
cgtcgagac 

<210> 645 
<211> 9 
<212> DNA 
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<213> 'Artificial Sequence 

<220> • 

<22 3> -synthetic construct 

<400> ;645 : 
gccgtttga : 

<210> 64 6 

<2ii> 9 ; 

<212> DNA; 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 646; 
aaacggcac 

<210> .647; ; 
<211> 9 
<212> DNA; 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 647 . 
gcggagtga 

<210> 648 
<211> ;9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 648 
actccgcac 

<2i;0> ;64 9; ; 
<211> =9 . ; ■ ; ' 

<212> DNA' i -i; 
<213> Artificial Sequence 

<220> 

<223> synthetic construct 

<400> ;649; 
cgtgcttga 

<210> 650; 
<211> :9 
<212> DNA . 

<213> Artificial Sequence 

<22:0> 

<223> synthetic construct 
<400> 650 , 
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aagcacgac 

<210> 651 
<211> 9 
<212> DNA 

<213> Artificial: Sequence 
<220> 

<223> synthetic construct 

<400> 651 
ctcgaccga 

<210> 652 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 

<400> 652 
ggtcgagac 

<210> 653 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 653 
agagcagga 

<210> 654 
<211> 9 
<212> DNA 

<213> Artificial^ Sequence 
<220> 

<223> synthetic Construct 

<400> .654 , 
ctgctctac 

<210> 655 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> !655 
gtgctcgga 

<210> 656 
<211> 9 
<212> DNA 

<213> Artificial' Sequence 



9 
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<220>- 

<223> synthetic construct 

<400> 656 ; 
cgagcacac : 

<210>. 657 ; 
<211> 9 
<212> DNA ' 

<213> Artificial Sequence 
<22 0> 

<223> synthetic construct 

<40 0> 657 ; 
ctcgacaga 

<210> 658 \ 
<211> 9 
<212> DNA : 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 658 ; 
tgtcgagac 

<210> 659 : 
<211> 9 
<212> DNA i 

<21;3> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 659 : 
ggagagtga ; 

i 

<210> 660 I 
<211> 9 ; 
<212> dna; 
<213> Artificial Sequence 

<220> i 

<223> synthetic construct, 

<400>. 660 ! 
actctccac ; 

* I 

<210> 661 ; 
<211> 9 j ... 
<212> DNA | . 

<213> Artificial Sequence 
<22 0> 

<223> synthetic construct 

<400>, 661 j 
aggctgtga I . ;• . 
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<210> 662 
<211> 9 
<212> DNA . 

<213> Artificial. Sequence = 



<220> 

<22 3> synthetic construct 



<400> 662 
acagcctac 



9 



<210> 663 ■ 
<211> 9 
<212> DNA : 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 663 

agagcacga 9 

<210> 664 . 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 664 

gtgctctac 9 

<210> 665 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 665 
ccatcctga 



9 



<210> 666 .: 
<211> 9 

<212> DNA | ■ 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 666 
aggatggac 



9 



<210> 667 
<211> 9 
<212> DNA | 

<213> Artificial Sequence 
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<220> : 
<223> synthetic construct 

<400> 667 
gttcggaga 

<210> 668 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 668 
tccgaacac 

<210> 669 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 669 
tggtagcga 

<210> 670 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 670 
gctaccaac 

<210> 671 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 671 
gtgctccga 

<210> 672 
<211> 9 

<212> DNA ' ; 
<213> Artificial Sequence 

<220> ; 

<223> synthetic construct 

<400> 672 
ggagcacac 
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<210> 673 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 673 
gtgctcaga 

<210> 674 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 674 
tgagcacac 

<210> 675 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 675 
gccgttgga 

<210> 676 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> :676 
caacggcac 



<210> 677 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 677 
gagtgctga j 

<210> 678 
<211> 9 .- : 
<212> DNA 

<213> Artificial Sequence 
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9 



9 



9 



9 
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<220> 

<2 23> synthetic construct 



<400> 678 
agcactcac 



9 



<210> 679 : 

<211> 9 

< 2 1 2 > DNA ; 

<213> Artificial Sequence 



<220>. 

<223> synthetic; construct 



<400> 679 
gctccttga 



9 



<2io> 680 ;; 

<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic! construct 
<400> 680 

aaggagcac 9 : 

<210> 681 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 681 

ccgaaagga 9 

<210> 682 

<211> 9 ;. :/ J 

<212> DNA 

<213> Artificial Sequence 



<220> ; * 

<22 3> synthetic construct 



i 



<400> 682 
ctttcggac 



9 



<210> 683 . 
<211> 9 
<212> DNA ■ 

<213> Artificial Sequence 



<220> ' 

<22 3> synthetic construct 



<400> 683 
cactgagga 



9 
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<210> 684 
<211> 9 : 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 684 
ctcagtgac 

<210> 685 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 685 
cgtgctgga 

<210> 686 
<211> 9 
<212> DNA 

<213> Artificial Sequence: 
<220> 

<223> synthetic construct 

<400> 686 
cagcacgac 

<210> 687 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 687 
ccgaaacga 

<210> 68!8 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct, 

<400> 688 
gtttcggac 

<210> 689 
<211> 9 ; 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<22 3> synthetic construct 



<4 00> 689 
gcggagaga ; 



9 



<210> 690 
<211> 9 
<212> DNA 

<213> Artificial iSequence 



<220> 

<223> synthetic construct 



<400> 690 
tctccgcac 



9 



<210> 691 

<211> 9 

<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> synthetic construct 

<400> 691 

gccgttaga 9 

<210> 692 

<211> 9 . ■ 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 692 ; 

taacggcac 9 

<210> 693 ! 

<211> 9 

<212> DNA 

;<213> Artificial Sequence 



:<220> ■■ , 

<223> synthetic construct 



<400> 693 ; 
tctcgtgga 



9 



<210> 694' 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 694 
cacgagaac 



9 
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<210> 695 ! 
<211> 9 . 
<212> DNA 

<213> Artificial. Sequence 



<220> I . 

<223> synthetic construct 



<400> 695 
cgtgctaga 



9 



<210> 696 
<211> 9 . 
<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 



<400> 696 
tagcacgac 



9 



<210> 697 
<2ll> 9 : 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 697 

gcctgtctt 9 

<210> 698 ' 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 698 
gacaggctc 



9 



<210> 699 
<211> 9 
<212> DNA ' 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 699 
ctcctggtt 



9 



<210> 700 
<211> 9 
<212> DNA 

<213> Artificial; Sequence 



<220> 
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<22 3> synthetic construct 

<400> 700 
ccaggagtc 

<210> 701 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 701 
actctgct-t 

<210> 702 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 702 
gcagagttc 

<210> 703 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 703 
catcgcctt 

<210> 704 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> i 

<223> synthetic construct 

<400> 704 
ggcgatgtc 

<210> 705 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 705 

gccactatt 
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9 



9 



9 



9 



9 
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<210> 706 

<211> 9 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 706 
tagtggctc 



9 



<210> 707 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 707 

cacacggtt 9 

<210> 708 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct .' 
<400> 708 

ccgtgtgtc 9 

<210> 709 

<211> 9 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 709 
caacgcctt 



9 



<210> 710 

<211> 9 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 710 
ggcgttgtc 



9 . 



<210> 711 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> synthetic construct 

<400> 711 
actgaggtt 

<2l0> 712 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 712 
cctcagttc 

<210> 713 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 713 
gtgctggtt 

<210> 714 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 714 
ccagcactc 

<210> 715 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 715 
catcgactt 

<210> 716 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 716 . .' 
gtcgatgtc 
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9 



; 9 



9 



9 



9 
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<210> 717 

<211> 9 

<c2\2> DNA 

<213> Artificial Sequence 
<220> 

<22;3> synthetic construct 

<400> 717 
ccatcggtt 

<210> 718 
<211> 9 
<212> DNA 

;<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 718 
ccgatggtc 

<210> 719 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 719 
gctgcactt 

<210> 720 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 720 
gtgcagctc 

<210> 721 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 

<400> 721 
acagaggtt 

<210> 722 
<211> 9 ; ; 

<212> DNA 

<213> Artificial Sequence 
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9 



9 



9 



9 
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<220> 

<223> synthetic construct 



<400> 722 
cctctgttc 



9 



<210> 723 
<211> 9 , 
<212> DNA ; : 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 723 
agtgccgtt 



9 



<210> 724 ! 

<211> 9 ! ' 

<212> DNA 

<213> Artificial Sequence 



<220> 

<2 23> synthetic construct 



<400> 724 
cggcacttc 



9 



<210> 725 :'. 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 725 

cggacattt ; 9 
<210> 726 

<211> 9 i : 

<212> DNA ; 
<213> Artificial Sequence ■ 



<400> 726 
atgtccgtc 

<210> 727 \ 
<211> 9 
<212> DNA . 

<213> Artificial Sequence 

<220> : i ! 

<223> synthetic construct 

<400> 727 



<220> . 
<223> synthetic construct 



ggtctggtt 



9 
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<210>. 728 
<211> 9 . 
<212> DNA ! 

<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 



<400> 728 
ccagacctc 



9 



<210> 729 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 729 : 

gagacggtt ; 9 

<210> 730 ... 



<210> 731 ■ 
<211> 9 
<212> DNA , 

<213> Artificial Sequence 
<220> ; 

<223> synthetic construct 

<400> 731 ! 
ctttccgtt ' 

<210> 732 : 
<211> 9 
<212> DNA | 

<213> Arti-ficial Sequence 
<220> 

<22 3> synthetic construct 



<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 



<400> 730 
ccgtctctc : 



9 



<400> 732 
cggaaagtc 



9 



<210> 733 ; 

<211> 9 1 ; 

<212> DNA ; 

<213> Artificial Sequence 



133/162 



WO 2005/058479 
<220> 

<223> synthetic construct 

<400> 733 : 
cagatggtt 

<210> 734 ' 
<211> 9 
<212> DNA 1 

<213> Artificial Sequence 
<220> 

<223> synthetic -construct 

<400> 734 
ccatctgtc 

<210> 735 

<211> 9 '■:'■} 

<212> DNA . 

<213> Artificial 'Sequence 
<220> 

<223> synthetic construct 

<400> 735 
cggacactt 

<210> 736 : 
<211> 9 
<212> DNA : 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 736 
gtgtccgtc 

<210> 737 
<211> 9 

<212> DNA ;•• ; 
<213> Artificial Sequence 

<220> . : 

) , • ; i 

<223> synthetic construct 

<400> 737 : 
actctcgtt 

<210> 738 

<211> 9 - !; 

<212> DNA ; 

<213> Artificial Sequence 

<220> j 

<223> synthetic icons truct 

<400> 738 : 
cgagagttc; ■ 
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9 . 



9 



9 



9 



9 
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<210> 739 ! 

;<211> 9 

\<212> DNA ; : : 

<213> Artificial Sequence 

<220> 

<223> synthetic construct 

: <400> 739 i 
gcagcactt 

: <210> 740 ■ 

<211> 9 

<212> DNA 

: <213> Artificial. Sequence 

<220> 

<223> synthetic construct 

<4Q0> 740 
gtgctgctc 

, <210> 741 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 741 
i actctcctt 

<210> 742 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 

<400> 742 : 
ggagagttc 

<210> 743 
<211> 9 , 
: <212> DNA : 
<213> Artificial Sequence 

<220> . 

<223> synthetic construct 

<400> 743 
accttggtt 

<210> 744 ; : 

. <211> 9 : 

<212> DNA 

<213> Artif icial:. Sequence 
<220> ( 
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9 



9 



9 



9 
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<22 3> synthetic icons t rue t 

<400> 744 
ccaaggttc 

<210> 745 

<211> 9 

<212> DNA : 

<213> Artificial Sequence 

<220> 

<223> synthetic construct 

<400> 745 
agagcegtt 

<210> 746 
<211> 9 
<212> DNA 

<213>' Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 746 
cggctcttc 

<210> 747 . 
<211> 9 
<212> DNA 

<213> Artific-ial Sequence 
<220> 

<223> synthetac construct 

<400> 747 
accttgett 

<210> 748 

<211> 9 

<212> DNA ! 

<213> Artificial Sequence 

<220> ■ ■) 

<223> synthet;ic construct 

<400> 748 
gcaaggttc 

■ ■ . 'i 
<210> 749 
<211> : 9 
<212> DNA ; 
<213> Artificial Sequence 

<22o> ; 

<223> synthetic construct 

<400> 749 
aagtccgtt 

<210> 750 
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9 



9 



9 



9 



9 i 
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<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 750 
cggactttc 

<210> 751 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<22p> 

<223> synthetic construct 

<400> 751 
ggactggtt 

<210> 752 
<211> 9 
<212> DNA. 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 752 
ccagtcctc 

<210> 753 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 753 
gtcgttctt 

<210> 754 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 754 
gaacgactc 

<210> 755 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
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9 



9 



9 



9 
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; <400> 755 
cagcatctt 



9 



<210> 756 . 
<211> 9 
<212> DNA ; 

<213> Artificial Sequence 



<220> • 

<223> synthetic construct 



<400> 756 : 
gatgctgtc . 



9 



<210> 757 ; 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<2 23> synthetic construct 



<400> 757 
ctatccgtt 



9 



<210> 758 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 758 

cggatagtc 9 

<210> 759 
<211> 9 
<212> DNA 

<213> Artificial Sequence ; 



<210> 760 

<211> 9 | 
<212> DNA ; 

<213> Artificial Sequence \ 
<220> ; 

<223> synthetic construct \ 
<400> 760 ! 

cgagtgttc ', 9 
<210> 761 

<211> 9 : ; 



<220> • . 

<223> synthetic construct • 



<400> 759 
acactcgtt 



9 
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<212> DNA ' 

<213> Artificial Sequence 
<220> . ' 

<223> synthetic construct 
<400> 761 . . 

atccaggtt 9 

<210> 762 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400>; 762 

cctggattc 9 

<210> ! 763 
<211> 9 
<212> DNA . 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 763 

gttcctgtt i 9 

<210> 764 f 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 764 

caggaactc 9 

<210> 765 ] - ' 
<211> 9 . 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 765 1 

acactcctt / 9 

<210> ! 766 ] ' 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> . 

<223> synthetic construct 



139/162 
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<400> 766 
ggagtgttc 



9 



<210> 767 

<211> 9 

<212> DNA . 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 767 
gttcctctt 



9 



<210> 
<211> 
<212> 
<213> 



768 
DNA 

Artificial Sequence 



<22 0> 
<223> 



synthetic construct 



<400> 768 
gaggaactc 



9 



<210> 769 . 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 769 

ctggctctt 9 

<210> 770 : 
<211> 9 ; 
<212> DNA 



<213> Artificial Sequence 



<22 0> 

<223> synthetic construct 



<400> 770 
gagccagtc 



9 



<210> 771 
<211> 9 
<212> DNA : 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 771 
a;cggcattt 



9 



<210> 772 
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<211> 9 ;! 
<212> DNA ( ; 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 772 
atgccgttc : 

<210> 773 ; . 
<211> 9 ' :; 

<212> DNA ; " 
<213> Artificial Sequence 



<220> 

<2 23> synthetic construct 

<400> 773 
ggtgaggtt 

<210> 774 
<211> 9 
<212> DNA ; 
<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 

<400> 774 . 
cctcacctc ! ; 

<210> 775 : ! 
<211> 9 " 
<212> DNA : 

<213> Artificial ; Sequence 
<220> 

<223> synthetic construct 

<400> 775 '; ; 
ccttccgtt ;i 

<210> 776 , 

<211> 9 ) y;"'- ■ ! ■' 

<212> DNA i; !; 

<213> Artificial ; Sequence 

<220> , ! 

<223> synthetic construct 

<400> 776 • 
cggaaggtc :. \ ■ 

<210> 777 ! ! 
<211> 9 : 



<212> DNA ; ; 

<213> Artificial Sequence 



<220> 
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<223> synthetic construct : 



<400> 777 

tacgctctt 9 

<210> 778 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> ; 

<223> synthetic construct 
<400> 778 

gagcgtatc 9 

<210> 779 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 779 
acggcagtt 



9 



<210> 780 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 780 
ctgccgttc 



9 



<210> 781 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> | 

<223> synthetic construct 



<400> 781 
actgacgtt 



9 



<210> 782 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 782 
cgtcagttc 



9 
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<210> 783 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 783 
acggcactt 



9 



<210> 784 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 784 

gtgccgttc 9 

,<210> 785 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> . 

<223> synthetic construct 
<400> 785 

actgacctt 9 

<210> 786 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 786 

ggtcagttc- 9 

<210> 787 ! 
<211> 9 \ 
<212> DNA 

<213> Artificial Sequence 



<210> 788 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 



<220> 

<223> synthetic construct 



<400> 787 
tttgcggtt 



9 
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<2-23> synthetic construct 

<4:0G> 788 
ccgcaaatc 

<2'l0> 789 
<211> 9 
<2;12> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 789 
tggtaggtt 

<210> 790 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 790 
cctaccatc 

<210> 791 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 

<400> 791 
gttcggctt 

<210> 792 
<211> 9 
<2;12> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 792 
gccgaactc 

i 

<210> 793 

<211> 9 ; 
<2,12> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 793 
gccgttctt 

<210> 794 
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9 



9 



9 



9 



9 
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<211> 9 : 
<212>. DNA;. 

<213> Artificial Sequence 

<22o> : 

<223> synthetic construct 

<400> 794? 
gaacggctc ' 

<210> 79s! 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 795 . 
ggagaggtt 

<210> 796 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 796 
cctctcctc 

<210> 797 

<211> 9 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 797^ 
cactgactt! 

,<210> 798- 
<211> 9 : 
<212> DNA; : - 

<213> Artificial Sequence 
<220> I 

<223> synthetic construct 

<400> 798; 
gtcagtgtc 

<210> 799 
<211> 9 
<212> DNA 

<213> Artificial Sequence 

<22o> ; 

<223> synthetic construct 



PCT/US2004/042964 



9 



9 



9 



9 
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<400> 799 
cgtgctctt 



9 



<210> 800 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 800 , 

gagcacgtc .9 

<210> 801 ' 
<211> 9 ; 

<212> DNA ■ 
<213> Artificial Sequence 



<210> 802 

<211> 9 ■ 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 802 

gcggatttc 9 

<210> 803 

<211> 9 ' , , 

<212> DNA ; 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 801 
aatccgctt 



9 



<220> : I 

<223> synthetic construct 



<400> 803 
aggctggtt 



9 



<210> 804 

<211> 9 . | 

<212> DNA ! 

<213> Artificial Sequence 



<220> ! : 

<223> synthetic construct 



<400> 804 
ccagccttc 



9 



<210> 805 
<211> 9 
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<212> DNA 

! <213> Artificial Sequence 



;<220> 

<223> synthetic construct 3 



<400> 805 
gctagtgtt 



<210> 806 ; 
:<211> 9 
<212> DNA " 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 806 

cactagctc 9 

<210> 807 
<211> 9 
<212> DNA 

<213> Artificial Sequence; 
<220> 

<223> synthetic construct 
<400> 807 

ggagagctt { 9 

<210> 808 ■ 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 808 
gctctcctc 



9 



<210> 809 . 

<211> 9 : !: 

W:212> DNA ) } ■ 

<213> Artificial Sequence 



<220> 

<22 3> synthetic construct 



<400> 809 
ggagagatt .' 



9 



<210> 810 
<211> 9 ■ J 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct. 
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<400> 810 
tctctcctc 



9 



<210> 811 
<211> 9 : 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 811 
aggctgctt 



9 



<210> 812 

<211> 9 

<212> DNA . 

<213> Artificial Sequence 

<220> ; 
<223> synthetic construct 

<400> 812 

gcagccttc .9 

<210> 813 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<210> 814 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 814 ; ' : 



<220> 

<223> synthetic construct 



<400> 813 
gagtgcgtt 



9 



cgcactctc 



9 



<210> 815 

<211> 9 

<212> DNA 

<213> Artificial Sequence 



<220> 

<2 23> synthetic construct 



<400> 815 
ccatccatt 



9 



<210> 816 
<211> 9 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223>; synthetic construct 

<400> 816 
tggatggtc 

<210> 817 

<211> 9 ^ : 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 817 
gctagtctt 

<210> 818 
<211> 9 
<212> DNA 

<213> Artificial iSequence 

<220> : 

<223> synthetic construct 

<400> 818 
gactagctc 

<210> 819 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 819 
aggctgatt 

<210> 820 
<211> 9 
<212> DNA 

<213> Artificial Sequence 

<220> :; ; ' 

<223> synthetic : construct 

i 

<400> 820 
tcagccttc 

<210> 821 

<211> 9 , 

<212> DNA ; 

<213> Artificial Sequence 

<220> j 

<223> synthetic construct 

<400> 821 ! i 
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9 



9 



9 ••• ■ •' 



9 
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a'cagacgtt 



<210> 822 
<211> 9 
<2 12 > DNA 

<213>, Artificial Sequence 



<220> 

<223> " synthetic construct 

<400> : 822 
cgtctgttc 

<210> 823 . 
<211> 9 
<212> DNA 

<213> ..Artificial Sequence 
<220> 

<223> synthetic construct 

<400> : 823 ; 
gagtgcctt 

<210> 824 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 824 
ggcactctc 

<210> 825 
<211> 9 
<212> DNA 

<213> : Artificial Sequence 
<220> 

<22 3> | synthetic construct 

<400>; 825 : 
acagacctt : 

<210>:826 
<211> 9 

<212> ; DNA , 
<2 13 > Artificial Sequence 

<220> 

<223> . synthetic construct 

<400>; 826 '' 
ggtctgttc 

<210>. 827 V 
<211> 9 



9 



<212> 
<213> 



DNA , 

Artificial Sequence 
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<220> 

<2 23> synthetic construct; 

<400> 827 
cgagctttt 

<210> 828 

<211> 9 i 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 828 
aagctcgtc 

<210> 829 
<211> ? 
<212> DNA 

<213> Artificial Sequence 

i 

<220> 

<223> synthetic construct; 

<400> 82:9 
ttagcggtt 

<210> 830 
<211> 9 
<212> DNA 

<213> Artificial Sequence^ 
<220> 

<223> synthetic construct 

<400> 830 ] 
ccgctaatc 

<210> 831 
<211> 9 
<212> DNA 

<213> Artificial Sequence 

<220> , 
<223> synthetic construct: 

<400> 831, : 
cctcttgtt 

<210> 832 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct. 

<400> 832 5 
caagaggtc,, 
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9 



9 



9 



9 



9 
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<210> 833 

<211> 9 < 
<212> DNA ; 

<213i> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 833 
ggtctcttt 

<210> 834 : : 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> : 

<2 23> synthetic construct 

<400> 834 
agagacctc 

<210> 835 i 
<211> 9 
<212> DNA . 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 835 
gccagattt ! 

<210> 836 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> : 

<223> synthetic construct 

<400> 836 ' 
atctggctc 

<210> 837 

<211> 9 ! i 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 837 
gagaccttt j 

<210> 838 
<211> 9 
<212> DNA . 

<213> Artificial Sequence 
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9 



9 



9 



9 
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■.'..<• 
<220> ;. j 

<2 23> synthetic construct 

<400> 838 
aggtctctc ; 

<210> 839 : j ■ 

<211> 9 : ' 

<212> DNA : I 

<213> Artificial Sequence 

<220> ; I 

<22 3> synthetic construct 

<400> 839 ' 
cacacagtt . 

<210> 840 ; : : 

<211> 9 
<212> DNA • 

<213> Artificial Sequence 

i 

<220> 

<22 3> synthetic construct 

<400> 840 
ctgtgtgtc 

<210> 841 
<211> 9 
<212> DNA 

<213> Artificial Sequence 

<220> ; 

<223> synthetic construct 

<400> 841 
cctcttctt 

<210> 842 ! 
<211> 9 
<212> DNA . 

<213> Artificial -Sequence 

<220> 

<223> synthetic construct 

<400> 842 
gaagaggtc 

<210> 843 : } 
<211> 9 : : 

<212> DNA ■ 

<213> Artificial Sequence 

: ' ' j 

,<220> ; . ' : , 

<223> synthetic construct 

<400> 843 ! : 
tagagcgtt | 
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9 



9 



9 



9 



9 
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<210> : 844 . 
<211> 9 
<212> DNA ; 

<213> Artificial Sequence 

t ■ ■ "". • 

<220> 

<223> synthetic construct 

<400> 844 
cgctctatc ; • 

<210> 845 : 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic ^construct 

<400> 845 ; i 
gcacctttt 

<210> 846 : 
<211> 9 
<212> DNA ; 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 846 • 
aaggtgctc ■ 

<210> 847 ; 
<211> 9 
<212> DNA 

<213> Artificial Sequence 

■ j ■ 

<220> ; 

<223> synthetic -construct 

<400> 847 ; 
ggcttgttt '■ \ ■ 

<210> 84'8 ' .; 
<211> 9 | 
<212> DNA ; 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<40 0> 84 8 ' 
acaagcctc 

<210> 849 : 
<211> 9 
<212> DNA ; 

<213> Artificial Sequence 
<220> 
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9 



9 



9 



9 
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<223> synthetic construct 

<400> 849 
gacgcgatt : 

<210> 850 
<211> 9 
<2 12 > DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 850 
tcgcgtctc 

<210> 851 
<211> 9 
<212> DNA ■ 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 851 
cgagctgtt 

<210> 852 
<211> 9 . . 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 852 
cagctcgtc 

<210> 853 
<211> 9 
<212> DNA 

<213>: Artificial Sequence 

; j 

<220>,! : 
<223> synthetic construct 

<400>853 
tagagcctt 

<210> 854 | 
<211> 9 
<2 12 > ; DNA 

<213> Artificial Sequence 

<220> ; 

<223>; synthetic construct 

<400> 854 
ggctctatc ! 

<210> 855 
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9 



9 



9 



9 



9 
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<2ii> 9 ; i 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 855 
catccgttt 



9 



<210> 856 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 856 . 

acggatgtc 9 

<210> 857 
<211> 9 
<212> DNA ! 

<213> Artificial Sequence 
<220> 

<2 23> synthetic construct 
<400> 857 . 

ggtctcgtt 9 

<210> 858 
<211> 9 
<212> DNA ; 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 858 ; 
cgagacctc 



9 



<210> 859 . . 
<211> 9 ; ! : " 

<212> DNA 

<213> Artificial Sequence 



<220> 

<2 23> synthetic construct 



<400> 859 
gccagagtt 



9 



<210> 860 



<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> synthetic 



construct; 



<400> 860 

ctctggctc 9 

<210> 861 ■ 
<211> 9 
<212> DNA 

<213> Artificial Sequence. 
<220> 

<223> synthetic : construct 
<400> 861 

gagaccgtt 9 

<210> 862 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> : ; 

<223> synthetic construct 



<400> 862 
cggtctctc 



9 



<210> 863 ' .; 

<211> 9 ; 
<212> DNA . . '< ■ \ 
<213> Artificial Sequence 



<220> ; i 

<223> synthetic construct 



<400> 863 
cgagctatt 



9 



<210> 864 
<211> 9 : 
<212> DNA 



<213> Artificial Sequence 



<22;o> • : ;' 

<22 ; 3> synthetic construct 



<400> 864 
tagctcgtc 



9 



<210> 865 
. <211> 9 1 
<212> DNA 

<213> Artificial Sequence 



<220> , .' " J 

<223> synthetic construct 



<400> 865 
gcaagtgtt 



9 



<210> 866 



157/162 



WO 2005/058479 

<211> 9 ' 
<212> DNA : 

<213> Artificial Sequence 

<220> j ; 'i 

<223> synthetic construct 

<400> 866 j 
cacttgctc 

<210> 867 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 867 
ggtctcctt 

<210> 868 
<211> 9 

<212> DNA . .; 

<213> Artificial Sequence 

<220> j 

<223> synthetic construct 

<400> 868 . -v 

ggagacctc 

<210> 869 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 

<400> 869 
gccagactt 

<210> 870 t ..; .; 

<211> 9 I 

<212> DNA : .. i 

<213> Artificial Sequence 

<220> ! 

<22 3> synthetic construct 

<400> 870 : 
gtctggctc 

<210> 871 :j 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
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9 



9 



9 
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<400> 871 
ggtctcatt 



9 



<210> 872: 
<211> .9 
: <212> DNA 
<213> Artificial Sequence 



<220> • 

<223> synthetic construct 



<400> 872 
tgagacctc 



9 



<210> 873 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

. <223 > synthetic construct 



<400> 873 
gagaccatt 



9 



<210> 874 



<211> 9 :.. 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<4 00> 874 

tggtctctc 9 

<210> 875 
. <211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

i<223> synthetic construct 
<400> 875 

ccttcagtt 9 
<210> 876 

<211> 9 i. 
<212> DNA 

<213> Artificial Sequence 



<220> 

•<223> synthetic construct 



. <400> 876 
ctgaaggtc 



9 



<210> 877 
<211> 9 
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<212> DNA 

<213> Artificial Sequence 
<220> . , 

<223> synthetic construct 

<400> 877 
gcacctgtt 

<210> 8J8 
<211> 9. 
; <212> DNA 
<213> Artificial Sequence 

<220> 

<223> synthetic construct 

<400> 878 
caggtgctc 

<210> 879 

<211> 9 

<212> DNA . 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 879 
aaaggcgtt 

<210> 880 
<211> 9 
<212> DNA 

<213> Artificial SBqu&nco 
<220> 

<223> synthetic construct 

<400> 880 
cgccttttc 

<210> 881 
<211> 9^ 1 ' : 
. <212> DNA 
<213> Artificial Sequence 

<220> 

<223> synthetic construct 

<400> 881 
cagatcgtt 

<210> 882 

<211> 9 • ; 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
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9 



9 



9 



9 
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<400> 882 
cgatctgtc 



<210> 883 
<211> 9 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic . construct 



<400> 883 
cataggctt 



9 



<210> 884 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 884 

gcctatgtc 9 

<210> 885 
<211> 9 
<212> DNA 



<210> 886 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> synthetic construct 
<400> 886 

gtgaaggtc 9 

<210> 887 
<211> 9 

<212> DNA ; : 

<213> Artificial Sequence 



<213> Artificial Sequence 



<220> 

<223> synthetic construct 



<400> 885 
ccttcactt 



9 



<220> 

<223> synthetic construct 



<400> 887 
gcacctctt 



9 



<210> 888 
<211> 9 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> synthetic construct 

<400> 888 
gaggtgctc 

<210> 889 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 889 ; 

cagaagacag acaagcttca cctgc 

<210> 890 [ 
<211> 27 
<212> DNA ; 

<213> Artificial Sequence 
<220> 

<223> synthetic construct 
<400> 890 

gcaggtgaag cttgtctgtc ttctgaa 
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9 



25 



162/162 



